M208 


Pure mathematics 


Book F 
Analysis 2 


The Open 
University 


This publication forms part of an Open University module. Details of this and other Open University modules 
can be obtained from Student Recruitment, The Open University, PO Box 197, Milton Keynes MK7 6BJ, 
United Kingdom (tel. +44 (0)300 303 5303; email general-enquiries@open.ac.uk). 


Alternatively, you may visit the Open University website at www.open.ac.uk where you can learn more about 
the wide range of modules and packs offered at all levels by The Open University. 


The Open University, Walton Hall, Milton Keynes, MK7 6AA. 
First published 2018. 
Copyright © 2018 The Open University 


All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, transmitted or 
utilised in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without 
written permission from the publisher or a licence from the Copyright Licensing Agency Ltd. Details of such 
licences (for reprographic reproduction) may be obtained from the Copyright Licensing Agency Ltd, Barnard’s 
Inn, 86 Fetter Lane, London EC4A 1EN (website www.cla.co.uk). 


Open University materials may also be made available in electronic formats for use by students of the 
University. All rights, including copyright and related rights and database rights, in electronic materials and 
their contents are owned by or licensed to The Open University, or otherwise used by The Open University as 
permitted by applicable law. 


In using electronic materials and their contents you agree that your use will be solely for the purposes of 
following an Open University course of study or otherwise as licensed by The Open University or its assigns. 


Except as permitted above you undertake not to copy, store in any medium (including electronic storage or use 
in a website), distribute, transmit or retransmit, broadcast, modify or show in public such electronic materials in 
whole or in part without the prior written consent of The Open University or in accordance with the Copyright, 
Designs and Patents Act 1988. 


Edited, designed and typeset by The Open University, using PTẸEX. 
Printed in the United Kingdom by Hobbs the Printers Limited, Brunel Road, Totton, Hampshire, S040 3WX. 


ISBN 978 1 4730 2349 9 
2.1 


Contents 


Unit F1 Limits 


Introduction to Book F 


Introduction 


1 


Limits of functions 

1.1 What is a limit of a function? 
1.2 Limits and continuity 

1.3 Rules for limits 

1.4 One-sided limits 


2 Asymptotic behaviour of functions 

2.1 Functions which tend to infinity 

2.2 Behaviour as x tends to infinity 
3 Continuity: the classical definition 

3.1 The ¢-6 definition of continuity 

3.2 Continuity of some unusual functions 

3.3 Limits and other asymptotic behaviour 
4 Uniform continuity 

4.1 What is uniform continuity? 

4.2 A condition that ensures uniform continuity 
Summary 


Learning outcomes 


Solutions to exercises 


Unit F2 Differentiation 


Introduction 


1 


2 Rules for differentiation 

2.1 Combination Rules 

2.2 Composition Rule 

2.3 Inverse Function Rule 
3 Rolle’s Theorem 

3.1 Local Extreme Value Theorem 

3.2 Rolle’s Theorem 
4 Mean Value Theorem 

4.1 Mean Value Theorem 

4.2 Positive, negative and zero derivatives 
5 L’H6pital’s Rule 

5.1 Cauchy’s Mean Value Theorem 

5.2 L’Hopital’s Rule and its application 
Summary 


Differentiable functions 

1.1 What is differentiability? 

1.2 One-sided derivatives 

1.3 Continuity and differentiability 


Learning outcomes 


Table of standard derivatives 


Solutions to exercises 


57 


59 


59 
59 
67 
70 


75 
75 
80 
83 


89 
89 
92 


95 
95 
98 


103 
103 
106 


110 


111 


112 


113 


Unit F3 Integration 


Introduction 


1 


The Riemann integral 

1.1 Definition of the integral 

1.2 Criteria for integrability 

1.3 Properties of integrals 

1.4 Proofs of Theorem F44 and Theorem F46 (optional) 


2 Evaluation of integrals 
2.1 The Fundamental Theorem of Calculus 
2.2 Primitives 
2.3 Techniques of integration 
3 Inequalities, sequences and series 
3.1 Inequalities for integrals 
3.2 Wallis’ Formula 
3.3 The Integral Test 
4 Stirling’s Formula 
4.1 Comparing functions of n 
4.2 Calculating factorials 
4.3 Proof of Stirling’s Formula (optional) 
Summary 


Learning outcomes 


Table of standard primitives 


Solutions to exercises 


119 


121 


121 
123 
140 
145 
149 


153 
153 
156 
158 


167 
168 
172 
174 


178 
179 
180 
183 


186 


187 


188 


189 


Unit F4 Power series 199 


Introduction 201 
1 Taylor polynomials 201 
1.1 What are Taylor polynomials? 201 
1.2 Approximation by Taylor polynomials 206 
2 Taylor’s Theorem 210 
2.1 Taylor’s Theorem 210 
2.2 Taylor series 217 
3 Convergence of power series 220 
3.1 Radius of convergence 221 
3.2 Proof of the Radius of Convergence Theorem (optional) 228 
4 Manipulating Taylor series 231 
4.1 The Combination Rules and the Power Rule 232 
4.2 The Differentiation and Integration Rules 237 
4.3. The General Binomial Theorem and the Uniqueness Theorem 242 
5 Numerical estimates for m 246 
5.1 Tangent formulas 247 
5.2 Proof that 7 is irrational (optional) 249 
Summary 251 
Learning outcomes 251 
Table of standard Taylor series 252 
Solutions to exercises 253 
Acknowledgements 259 


Index 261 


Unit F1 
Limits 


Introduction 


Introduction to Book F 


In this book we put the foundations of calculus on a firm logical basis. At 
the end of Book D Analysis 1 you met the idea of a continuous function, 
and at the start of this book you will meet the related ideas of a limit of a 
function and of uniform continuity. Then we use these ideas to study 
differentiation and integration in detail. Finally, we discuss the 
representation of functions by power series. 


You will meet many applications of these ideas, including: 
e a technique for proving inequalities such as 
log(1 +x) >x-— $z’, for x € (0,00) 


a function which is continuous but nowhere differentiable 


e Stirling’s approximate formula for n! 
e several remarkable exact formulas for 7, including Wallis’ Product, 


ee Cee 2n r) 


si ey E . 
Sel 3 3 3 57 m-l m4l 
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e an elegant proof that m is irrational. 


Introduction 


In this unit you will meet the concept of a limit of a real function, which is 
closely related to the idea of a continuous function. Roughly speaking, a 
real function f has a limit at a point c if it is either continuous at c, or if it 
is defined near c and we can assign a value to f(c) that makes the function 
continuous at c. You will also study various types of asymptotic behaviour 
of functions — that is, their behaviour when the domain variable or the 
codomain variable becomes arbitrarily large. For example, you will see 
that if n € N, then z”/e” > 0 as x > ow. 


In the second half of the unit you will return to the topic of continuity and 
study an alternative definition, the so-called ¢-d definition of continuity. 
This is equivalent to the definition based on sequences that you studied in 
Unit D4 Continuity and, although it may appear to be more abstract, it is 
easier to use in certain situations. You will meet several unusual functions 
and see how the two definitions of continuity can be used to investigate at 
which points they are continuous or discontinuous. You may be surprised 
at the results! 


Finally, you will meet the concept of uniform continuity. This is a stronger 
form of continuity, defined using the ¢-6 approach, and it will play an 
important role when you study the integration of continuous functions. 


Many of the results in Sections 1 and 2 are analogues of results on 
sequences and continuity covered in previous units, so we omit their proofs. 


Unit F1 Limits 


1 Limits of functions 


In this section you will study the concept of a limit of a real function (that 
is, a function whose domain and codomain are subsets of R). The notion of 
a limit is of fundamental importance to differentiation, which you will 
study in the next unit, so you should make sure you have a good 
understanding of this material. 


1.1 What is a limit of a function? 


Sometimes we need to understand the behaviour of a function that is 
defined near a particular point, but not at the point itself. For example, 
consider the function 


f(a) = 


(This function arises when we prove that the sine function is differentiable, 
as you will see in Unit F2 Differentiation.) The graph of f in Figure 1 
suggests that as x gets closer and closer to 0, f(x) takes values which are 
closer and closer to 1. 


sin £ 


(z € R- {0}). 


sin x 


Figure 1 The graph of y = 


On the other hand, consider the function 
1 
g(x) = sin 7 (x € R — {0}). 


In this case, when x takes values close to 0, the values taken by g(a) do 
not lie close to any single real number: as you can see in Figure 2, g is 
highly oscillatory near 0. 


1 Limits of functions 


1 
Figure 2 The graph of y = sin — 
x 


We say that the function f has a limit as x tends to 0, but the function g 
does not. We now make this concept precise. 


First we introduce the idea of a punctured neighbourhood of a point c. 
This is simply a bounded open interval with midpoint c, from which the 


point c itself has been removed. We use the notation N,(c) for a punctured N,(c) 
neighbourhood of length 2r with centre c, so a 
c-r G CFT 


N. = (c — U h 0 
r(e) = (c= r,c) U (cpc +r), where r > 0, Figure 3 The punctured 


as illustrated in Figure 3. For example Nj(3) = (2,3) U (3,4). neighbourhood N,.(c) 


Definition 
Let f be a function defined on a punctured neighbourhood N,.(c) of c. 
Then f(x) tends to the limit l as x tends to c if l € R and 


for each sequence (zn) in N,(c) such that £n > c, 
f(tn) > l. 
In this case, we write 


lim f(x) =1 or f(x) >lasx—--c. 


Remarks 

1. The definition applies whether or not the function f is defined at the 
point c and, if it is defined there, irrespective of the value of f(c). 

2. Note that the limit lim f(x), if it exists, does not depend on which 

zc 

punctured neighbourhood of c is considered (that is, it does not depend 
on r). It is also important to note that the limit must be the same for 
every possible sequence (£n). 


Unit F1 Limits 


Figure 4 The graph of 
y= Va 


3. The above definition does not allow us to state that lim Vx exists, 
a 


because the domain [0,00) of f(x) = yz does not contain any 
punctured neighbourhood of 0. Later in this section we introduce the 
idea of a one-sided limit, and see that f(x) = yx has limit 0 as x tends 
to 0 from the right. The graph of this function is shown in Figure 4. 


4. Because this definition of a limit involves sequences, we often use results 
about sequences to determine whether a function has a limit at a 
particular point. 


We now use this definition to prove that (sinz)/xz — 1 as x — 0, as we 
guessed earlier. 


Theorem F1 
hess sın gt 


r>0 T 


= 


Proof ®. To prove this result we first use two trigonometric inequalities 
to establish upper and lower bounds for (sin x)/x on a punctured 
neighbourhood of 0. This then enables us to apply the Squeeze Rule for 
convergent sequences from Subsection 3.3 of Unit D2 Sequences. & 


First note that the function x | (sin x)/zx is defined on every punctured 
neighbourhood of 0. 
We now use the inequality 
sing <x, for0< g< 7/2, 
(proved in Subsection 2.3 of Unit D4) to deduce that 
sin x 


<1, tor <a < a2: (1) 


g 
Next we require the inequality 
x < tang, for0< z< T/2, (2) 


which follows by comparing the area of a sector of a disc of radius 1 
(shown in Figure 5(a)) with that of a i right-angled triangle which 
contains the sector (shown in Figure 5(b 


à lA 


shaded area = ir shaded area = itang 
(a) (b) 


Figure 5 (a) A sector of a disc (b) A triangle containing the sector 


®. Recall that a sector of angle 6 in a disc of radius r has area Or. B 


Since cos x > 0 for 0 < x < 1/2, we deduce from inequality (2) that 
cosg < a for 0 < z < T/2. 
ii 


®, Remember that a real function f is even if f(x) = f(—2x) for x ER. & 


Thus, by inequality (1) and the fact that the functions z —> cos x and 
x +> (sinx)/x are both even, 


cosg < a <1, ford < |x| < 7/2. (3) 
a 


®. We have now established lower and upper bounds for (sin x)/z in the 
punctured neighbourhood N,,/2(0). ® 


Now suppose that (xn) is any null sequence in the punctured 
neighbourhood N,,/9(0). Then 


sin £ 
cose, < — < 1, forn=1,2,..., (4) 
Tn 
by inequalities (3). Since £n — 0, we have cos £n + 1, because the cosine 
function is continuous at 0 and cos 0 = 1. Hence, by inequalities (4) and 
the Squeeze Rule for sequences, 
sin x 
—"* 31 asn > œ, 
Tn 


as required. E 


The limit in Theorem F1 was quite tricky to establish, but usually there 
are simpler ways to find limits. For example, we can determine many 
limits of functions by using the Combination Rules for sequences which 
you met in Unit D2. 


Worked Exercise F1 


Prove that each of the following functions tends to a limit as x tends to 2, 
and determine these limits. 


r?—4 _ a — 34-2 
— g2 — 3442 


(a) f=) Fw) 


1 Limits of functions 


Unit F1 Limits 


Thus if (xn) is any sequence in R — {2} such that £n —> 2, then 
f(@n) = tn +2 —>2+2=4 ano, 


by the Sum Rule for sequences. Hence 


(b) Since a — 3w + 2 = (x= 2)(a — 1), the domain of f is R= {1,2}. 


®. Because f is not defined at 1, the largest punctured 
neighbourhood of 2 in which f is defined is Nj (2), illustrated 


below. 
N,(2 
Se a 
1 2 3 


Thus f is defined on Nı (2) and 


flo) ee —32—-2 (a—2)(a7+27+1) 2742241 
W O ee Se ee ee eee 
xr? — 3442 (x — 2)(a — 1) z—1 


for x € N,(2). Thus if (zn) lies in N1(2) and £n — 2, then 
oe 4+2yt1 44441 _ 


= —______ + —— =9 
f (an) a 1 Dil ? 
by the Combination Rules for sequences. Hence 
oc — 30 —2 


im $< = 
zr? ¢2 — 34 +2 


Later in this section you will meet further techniques for finding limits. 
First, however, we give a strategy for proving that a limit does not exist. 


Strategy F1 


Let f be a real function defined on a punctured neighbourhood N,.(c) 
of c. 


To show that lim f(x) does not exist, either 
Te 


) 
e find two sequences (£n) and (yn) in N, (c) which tend to c, such 
that (f(£n)) and (f(yn)) have different limits, or 


e find a sequence (£n) in N,(c) which tends to c such that 
f (an) > œ or f(a) > —o0. 


Worked Exercise F2 


Prove that each of the following functions does not tend to a limit as x 
tends to 0. 


(a) f(w)=sin(1/z) (b) f(a) = 1/2 


Solution 
(a) ©. The graph of f is shown below. 


Yr 
1 


N 
Xy 


The graph suggests that we should be able to use the first part of 
Strategy F1, with one sequence of points whose images under f 
are equal to 1 and one sequence of points whose images under f 
are equal to —1. & 


The function f(x) = sin(1/x) has domain R — {0}. To prove that 
f(x) does not tend to a limit as x tends to 0, we choose two null 
sequences (zn) and (yn) in R — {0} such that 


f(tn) +1 whereas f(y) > —1. 
To do this we use the facts that 


sin(2na+7/2)=1 and sin(Qn7+3a/2)=-1, fornez. 


It follows that if we choose 


1 4 il 
= ———— an = — 
2na + 1/2 an 2nm + 30/2’ 


then tz, > 0, y, — 0 and, for n = 1,2,..., 
Mer =sn(l/g,)—1 and) f(y) —sml/y,) = 
So 


In 


nn 


f(a@n) 31 asn—>co and f(yn) >—-1 asn> œ. 


Hence f(x) = sin(1/z) does not tend to a limit as x tends to 0. 


1 Limits of functions 
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y 
1] y=f(x) 


Figure 6 The graph of a 
function which takes the 
value 1 except at x = 0. 


10 


Sy 


Here are some limits of functions for you to consider. Recall that |] is the 
integer part of x. 


Exercise F1 


Determine whether each of the following limits exists, and evaluate those 
limits which do exist. 


2 
Qin == (tai © lim Jog |z| 


x2—0 T al 


1.2 Limits and continuity 


Consider the function 
Le 22520, 
GG — 
f(a) P x=0, 


whose graph is shown in Figure 6. Does this function tend to a limit as x 
tends to 0 and, if so, what is the limit? Well, if (xn) is any null sequence 
with non-zero terms, then 


fiw) = 1h. form = 12 kay 
so 

f(n) 41 as n —> o. 
Hence 

lim f(x) = 1. 


This example illustrates the fact that the value of a limit lim f(a) is not 
zc 

affected by the value of f(c), if f happens to be defined at c. 

However, the following theorem shows that if f is defined and continuous 


at c, then the value of the limit must be f(c), and the converse statement 
is also true. 


1 Limits of functions 


Theorem F2 


Let f be a function defined on an open interval J, with c € J. Then 
f is continuous at c 

if and only if 
lim f(r) = f(e). 


The proof of Theorem F2 uses the fact that, in this situation, the 

definition of continuity of f at c, from Subsection 2.1 of Unit D4, is almost 

identical to the definition of the existence of lim f(x), with this limit equal 
wc 


to f(c). The only difference is that, in the former case, we allow the terms 
of the sequences (zn) which appear in the definition to equal c. We omit 
the details of this proof. 


Theorem F2 makes it easy to calculate many limits of continuous 
functions. For example, to determine 


lim (32° — 5a + 1), 

x—2 
we use the fact that the function f(x) = 32° — 5x? + 1 is continuous on R, 
since f is a polynomial. Hence, by Theorem F2, 


lim (32° — 5a? + 1) = f(2) = 77. 
r—2 


As a further example, you saw in Worked Exercise D48 in Unit D4 that 
the function 


2 . 
xf sin(1/x), x #0, 
f=) = 
0, m= 
is continuous at 0. Thus, by Theorem F2, 
lim z°sin(1/x) = 0. 
«z—0 
On the other hand, we saw in Worked Exercise F2(a) that 
lim sin(1/x) does not exist. 
«z—0 


It follows from Theorem F2 that, no matter how we try to extend the 
domain of the function f(x) = sin(1/x) to include x = 0, we can never 
obtain a continuous function. 


Exercise F2 


Use Theorem F2 to determine the following limits. 


(a) lim yz (b) lim Vsina (c) lim à 


È 
a—>2 xn /2 s>l l +r 
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Unit F1 Limits 


In the remainder of this unit we use Theorem F2 often, but we do not 
always refer to it explicitly. 


1.3 Rules for limits 


As you might expect from your experience with sequences, series and 
continuous functions, limits of functions can often be found by using 
various rules. First we state the Combination Rules for limits. These can 
be deduced from the corresponding rules for sequences; we omit the details. 


Theorem F3 Combination Rules for limits 
If lim f(z) = 1 and lim g(x) = m, then: 
Le Te 
Sum Rule lim (f(x) +g(x))=l+m 
Multiple Rule lim Af (x) = Al, for XE R 
LC 
Product Rule lim f(x)g(x) =lm 
LCE 
Quotient Rule lim f(z)/g(x) =1/m, provided that m # 0. 


For example, since 


. sine 
lim 


z0 T 


=1 and lim(z?+1)=1, 
x—>0 


we have, by the Combination Rules, 


lim (= +2(02+1)) =142x1=3. 
x—0 x 


Next we discuss the composition of limits. For example, consider the 
behaviour of 

sin(x?) 
2 Ej 


E 
as x tends to 0. This function can be written in the form 


sin u 
, where u = x°. 


u 
Now 


sin u 
u=2?—>0 asx—0 and 


>lasu->0, 


which suggests that 


sin(x?) 


5 >lasx—-0. 
x 


To justify this composition of limits, we use the following Composition 
Rule. This can be proved using properties of convergent sequences, limits 
and continuous functions that you have already met, but we do not give 
the details here. 
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Theorem F4 Composition Rule for limits 
if lime yee = U and lini g( 7) = JL, then 
LC rol 


lim g(f(x)) = L, 


Ae 
provided that 
either f(x) #1, for all x in some N,(c), where r > 0, 


or g is defined at l and continuous at J. 


Remarks 


1. When using this rule, it is important to remember that the limit for g is 
as x — l and not as x > c. This is perhaps easier to see if we rewrite 
the theorem as follows: 


If 

f(z) + lasxz—-c 
and 

g(t) 9 Lasz >l, 
then 

g(f(z)) > Lasx-e. 


2. Before you use the Composition Rule, you should check that one of the 
two provisos to Theorem F4 holds. (Sometimes both provisos will hold, 
but you only ever need to check that one does.) 


Perhaps surprisingly, the Composition Rule is false if we omit both of the 
provisos. For example, if 


T Jz TA, 
f(@)=1 and TE 


then 


lim f(x)=1 and lim g(z)= 2 
a1 


z—0 
so the Composition Rule would give lim g(f(x)) = 2. However, because 
r> 


f(x) = 1 for all values of x and g(1) = 0, we actually have 
lim g(f(2)) = 0 #2. 
x—>0 


For this example, neither of the provisos to Theorem F4 holds, because 
there is no punctured neighbourhood of 0 in which f(x) #1, and g is not 
continuous at 1. 


However, in nearly all cases in practice and in all the examples you will 
meet in this module, at least one of the provisos holds (though you should 
check this). 


1 Limits of functions 
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Unit F1 Limits 


This leads to the following strategy for using the Composition Rule. 


Strategy F2 


To use the Composition Rule to evaluate a limit of a function of the 
form g(f(x)) as x > c, do the following. 


1. Substitute u = f(x) and show that, for some J, 
u= f(x)—> l as z >c. 

2. Show that, for some L, 
glu)> L asu>l. 


3. Check that one of the provisos holds. 
4. Deduce that 


g(f(z)) > L asz >c. 


The following worked exercise illustrates how to apply this strategy. 


Worked Exercise F3 


Determine the following limits. 


: 1 : 2 
(a) lim eae) by, Tie (: 4 (=) 
x2—0 zT z—0 WH 
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® Notice that the second proviso does not hold in this case, 
since g is undefined at 0. However, we only need one of the 
provisos to hold to use the Composition Rule. .© 


Thus, by the Composition Rule, 


e as x — 0. 


g(F(2)) 


1 
zt 


(b) We can write 


PEER si : 
Substituting u = f(x) = sass we obtain 
z 


sin x 
Q= > 1 asz- o0, 
E 


C0 =e ea = iti eu l, 


since g is continuous at 1, which also tells us that the second 
proviso to the Composition Rule holds. 


®. Alternatively you could have noted that the first proviso 
holds, since f(x) 4 1 for x € Nı(0), for example. & 


Thus, by the Composition Rule, 


2 
sre) =1+ (=) 32 asz — o0. 


Exercise F3 


Use the Combination Rules and the Composition Rule to determine the 
following limits. 


sin £ . sin(sin x) x \1/2 
S. ometa omi) 
(a) a+0 2a + a2 (b) 240 sing se 20 \sing 
1— 
(d) lim cos x 
x—>0 x 


Hint: In part (d), use the identity cosx = 1 — 2sin? (4x). 


1 Limits of functions 
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Unit F1 Limits 


C=F C c+r 


Figure 7 ‘The Squeeze Rule 
for limits 
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There is also a Squeeze Rule for limits, analogous to the Squeeze Rules for 
sequences and continuous functions, whose proof we omit. This is 
illustrated in Figure 7. 


Theorem F5 Squeeze Rule for limits 
Let f, g and h be functions defined on N,(c), for some r > 0. If 
(al oe f(a) hor). tor ae N,(e} 
(b) lim g(x) = lim h(a) = 1, 
then 
liaa f(a) = l 


T= 


In the proof of Theorem F1, we showed that lim (sin x)/x = 1, using the 
inequalities A 


sin x 
cosz < — < 1, for0< |z| <7/2. 
g 


This was, in essence, an application of the Squeeze Rule for limits, with 

f(z) = (sinz)/z, g(x) = cosx and h(x) = 1. Since lim cosx = 1, the result 
r—> 

follows. 


In the next exercise the Squeeze Rule is used to establish another 
important limit. 


Exercise F4 
(a) Use the inequalities 
1 
l+a<e* < —., for |z| <1, 
1-2 


(proved in Corollary D49 in Unit D4) to show that 
x 
—1 
Bl gO oie E for 0 < |æ| < 1. 
Le x 1-2 


(b) Deduce from part (a) that 


le 


The limit found in Exercise F4 is one of the basic limits we often use. Here 
we record three such limits for future reference. The first was proved in 
Theorem F1, the second in Exercise F3(d) and the third in Exercise F4(b). 


(a) lim ei 
w IE 
ES 
O bm === = 
20) T 
T 
= 
(c) lim z = 
z—0 5B 


Earlier we mentioned that lim yz is not defined because the function 
2 


f(a) = Va is not defined on any punctured neighbourhood of 0. However, 
this function does tend to 0 as x tends to 0 from the right. 


Let f be a function defined on (c,c +r), for some r > 0. Then f(x) 
tends to the limit l as x tends to c from the right if 


for each sequence (zn) in (c,c +r) such that £n > c, 
f(an) > L 
In this case, we write 
lim f@)=1 or f(z) ol asx oc’. 
act 
There is a similar definition for a limit as a tends to c from the 
left, in which (c,c+ r) is replaced by (c — r,c). In this case, we write 


lim f(z)=l or f(#)>l ar>. 
Cc Cm 


We also refer to 


lim, f(x) and lim f(z) 


Larne cm 


as right and left limits, respectively. 


Sometimes both right and left limits exist but are different, as you will see 
in the next worked exercise. 


1 Limits of functions 
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Worked Exercise F4 


Prove that the following function tends to different limits as x tends to 0 
from the right and from the left. 


f= (x € R— {0}) 


Solution 


®. The graph of the function is shown below. 


y = 2/2 
joe 


= 2 
am 


Ry 


o 


The function f is defined on (0,1) and f(x) = 1 on this open interval. 


®. We could take any open interval of the form (0,r) with r > 0 
here. & 


Thus if (xn) is a null sequence in (0,1), then 
lining a — a = 
NCO n— o0 


Hence lim f(x) =1. 


x—> 0t 


Similarly, f is defined on (—1,0) and f(x) = —1 on this interval. 


®. We could take any open interval of the form (—r,0) with r > 0 
here. & 


Thus if (xn) is a null sequence in (—1,0), then 


lim f(g) = lim -1=-1. 
n—-Co n— o0 
Hence lim f(z) = —1. 
Ba) 


Since —1 Æ 1, the limits of f(x) as x tends to 0 from the right and 
from the left are different. 


The relationship between one-sided limits and ordinary limits is given by 
the following result, whose proof we omit. 
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Theorem F7 
Let the function f be defined on N,(c), for some r > 0. Then 


lim f(x) =1 
if and only if 
lim, a= im jax) = l 


TE cS Cm 


Analogues of the Combination Rules, the Composition Rule (and 
Strategy F2) and the Squeeze Rule can also be used to determine one-sided 


limits. In the statements of these rules, we simply replace lim by lim or 
zL—->C e—oct 


lim , and replace N,(c) by (c,c +r) or (c—1,c), as appropriate. Also, 


ge 

Strategy F1 can be adapted to show that a one-sided limit does not exist; 
the sequences (xn) and (yn) must be chosen to tend to c from the right or 
from the left, as appropriate. 


There is also a version of Theorem F2 for one-sided limits, as follows. 


Theorem F8 


Let f be a function whose domain is an interval J with a finite 
left-hand endpoint c that lies in J. Then 


f is continuous at c 
if and only if 
lim, f(x) = f(c). 


x—ct 


Remarks 


1. In Theorem F8, the interval J can have any of the forms [c, co), [c, b) or 
[c, b], where b > c. 

2. There is an analogous result to Theorem F8 for left limits. 

It follows from Theorem F8 that lim yg = 0, as claimed earlier, since 


xz—0+ 
f(a) = Vaz has domain [0, 00) and is continuous at 0. 


Exercise F5 


Prove the following. 


1 Limits of functions 
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Figure 8 A function for 
which f(x) tends to infinity as 
x tends to c 
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2 Asymptotic behaviour of functions 


In our discussion of graph sketching in Unit A4 Real functions, graphs and 
conics, we described several types of asymptotic behaviour (that is, 
behaviour of a function when the domain variable or codomain variable 
becomes arbitrarily large), such as: 


1 + x 
->œ asxr->0 and e” > œ as £> oœ. 
T 


In this section we define such statements formally and describe various 
relationships between them. 


2.1 Functions which tend to infinity 


In Section 1 we defined f(x) > las x — c for a finite limit / in terms of 
the behaviour of sequences. We can define 


f(z) + œ asa@-c 


in a similar way. This is illustrated in Figure 8. 


Definition 
Let the function f be defined on N,(c), for some r > 0. Then f(z) 
tends to oo as « tends to c if 
for each sequence (zn) in N,(c) such that £n > c, 
f(an) > œ. 


In this case, we write 


f(z) œ asx. 


Remarks 


1. 


In this module we do not use the notation lim f(a) = oo as this can give 
T>e 


the misleading impression that infinity can be treated in the same way 
as a finite limit. Algebraic manipulations of expressions involving oo are 
a common error in false proofs — as you saw, for example, for a series 
that is not convergent in Worked Exercise D29 in Unit D3 Series. 


. The statements 


f(z) + -co asz >c, 
f(x) — œ (or —o0) as z —> ct (orc), 


are defined similarly, with oo replaced by —oo and N,(c) replaced by 
the open interval (c,c +r) or (c — r,c), where r > 0, as appropriate. 


There is a version of the Reciprocal Rule which relates functions that tend 
to infinity and functions that tend to 0. (You met the Reciprocal Rule for 
sequences in Subsection 4.3 of Unit D2.) 


2 Asymptotic behaviour of functions 


Theorem F9 Reciprocal Rule for limits 
If the function f satisfies the conditions 

1. f(x) > 0 for x € N,(c), for some r > 0 

2. f(x) 9 0 as z >c, 


then 


1/f(z) 3 œ as z >c. 


For example, 


L/22 
[xz > œ as z > 0, Figure 9 The graph of 


because f(x) = z? > 0 for x € R — {0}, and lim x? = 0; see Figure 9. y = l/r? 
r> 


The Reciprocal Rule can also be applied with £x > c replaced by x — ct or 
xz —c_, and N,(c) replaced by (c,c +r) or (c — r,c), as appropriate. For ys 
example, we have 


1/4 œ as z > 0”, 


because f(x) = x > 0 for x € (0,00), and lim, x = 0; see Figure 10. 


x0 


Exercise F6 1- 
Prove that 
1 i > 
(a) Gee (b) Z 0 as z+ 0 1 T 
£ x 
1 P Figure 10 The graph of 
(c) zz? asa > l. joie 


There are also versions of the Combination Rules and the Squeeze Rule for 
functions which tend to co (or —oo) as x tends to c, ct or c~. Here we 
state the Combination Rules for functions which tend to oo as x tends to c. 


Theorem F10 Combination Rules for functions which 
tend to infinity 

If f(z) > co as x > cand g(x) > œ as x > c, then: 

Sum Rule f(z) + g(x) > œ as x > c 

Multiple Rule Af(x)— œ as x —> c, for A € Rt 

Product Rule f(x)g(x)— œ as z > c. 


These rules are analogous to the corresponding rules for sequences which 
tend to infinity; see Subsection 4.3 of Unit D2. 
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Figure 11 A function for 
which f(x) — las £x > co 
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2.2 Behaviour as x tends to infinity 


Next, we define various types of behaviour of real functions f(a) as x + oo 
or as x — —oo. To avoid repetition, in the following definition we allow the 
letter l to denote either a real number or one of the symbols co or —oo. 
The definition is illustrated in Figure 11. 


Definition 
Let the function f be defined on (R,0o), for some real number R. 
Then f(x) tends to l as x tends to oo if 


for each sequence (zn) in (R, o0) such that £n > 00, 
f(an) 3 1. 
In this case, we write 


f(z) 91 asx oœ. 


The statement 
f(z) 31 as z > —00 


is defined similarly, with co replaced by —oo, and (R,0o) replaced 
by (—co, R). Note that 


f(x) 31 as x 4 —00 
is equivalent to 
f(—x)—> l as z> oo. 


When / is a real number, we also use the notations 
Jim f(x) =! and Jim fH. 


Once again, we can use versions of the Reciprocal Rule and the 
Combination Rules to obtain results about the behaviour of given 
functions as x — oo or —oo. The new versions of these rules are obtained 
from the Reciprocal Rule and the Combination Rules in Subsection 2.1 
and the Combination Rules in Subsection 1.3 by replacing c by oo or —oo, 
and N,(c) by (R,0oo) or (—oo, R), as appropriate. 

Many results about the behaviour of functions f(x) as x tends to oo 

or —oco are derived from the following two basic facts, often by using the 
Combination Rules and the Reciprocal Rule. 


2 Asymptotic behaviour of functions 


Theorem F11 Basic asymptotic behaviour 
If n € N, then 


(a) 2” +00 as £t > CO 


1 
(by > 0'as 2 = co. 
T 


We can use Theorem F11, together with the Combination Rules and the 
Reciprocal Rule, to determine the asymptotic behaviour of various 
functions defined by quotients. This is similar to determining the 
behaviour of sequences defined by quotients (see Subsection 3.2 of 

Unit D2), and we give a corresponding definition of the dominant term of 
a quotient that suits the present context. 


Definition 


The dominant term of a quotient involving the real variable x is the 
term in x (without its coefficient) which eventually has the largest 
absolute value. 


For example, consider the behaviour of z/(x? +1) as x > oo. Here the 
dominant term is x”, so we divide both the numerator and the 
denominator by x? to give 
x 1/x 0 
= CT > —— 
xr? +1 1+1/r? 140 
by Theorem F11(b) and the Combination Rules. 


= 0 as z > œ, 


Exercise F7 


Prove that: 
2r’ 2x3 +1 
(a) lim a (b) sae — 00 aS T —> 00. 
z= 00 T x 


There are also versions of the Squeeze Rule for functions as x tends to 
infinity, which have some important applications. You met the 
corresponding versions of the Squeeze Rule for sequences in 

Theorems D10 and D18 in Sections 3 and 4 of Unit D2. 
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Theorem F12 Squeeze Rule for functions as x —> co 
Let f, g and h be functions defined on some interval (R, oo). 
(a) If f, g and h satisfy the conditions 

la alo) fa) = ha), for e E (Reo) 

2. im GCE) = in Ga 

where / is a real number, then 

jm, Ta) — 

(b) If f and g satisfy the conditions 

Lf) = ale) tor w E (Roo) 

2. g(x) 4 œ as E> CO 


then 


f(z) 3 œ as zt > ow. 


Exercise F8 


Use the Squeeze Rule to determine the behaviour of the following function 


as £ — 00. 
sin(1/x) 


f(a) = ŻE 
Hint: Use the fact that —1 < sin(1/z) < 1 for x £0. 


In Subsection 1.3 we gave the Composition Rule for limits and Strategy F2 
for using it. We can also use this strategy to deduce the asymptotic 
behaviour of composites of functions which have any of the types of 
asymptotic behaviour introduced in this unit, provided that we allow the 
letters l and L to denote either a real number or one of the symbols oo 

or —oo. Notice that if l is either oo or —co, then the first proviso of the 
Composition Rule is automatically satisfied, so there is no need to check 
this. 


For example, consider the asymptotic behaviour of amie as © > Oo. 
x 
We can write 
sin(1/x? 
#2) = g(f(0)), 
x 


where f(x) = x? and g(x) = m/s) 


. Substituting u = f(x) = 27, we 


u = f(t) =x? > 00 asx—oo (by Theorem F11), 


a= sin(1/u) 


ao 0 asu—oo (by Exercise F8). 


2 Asymptotic behaviour of functions 


Thus we deduce by Strategy F2 that 


ata) = PC) 0 as © > o0. 


In the next theorem, we collect together several standard results about the 


behaviour of particular functions as x > oo. 


Theorem F13 
(a) If ao,ai1,...,@n—1 E€ R, where n € N, and 


p(x) = 2" Fandt 1+---+a12% + a0, 


then 
1 
p(z) +00 asx—oo and —~—>0 asz -> o. 
p(x) 
(b) For each n = 0,1,2,..., we have 
x m 


L 
— >œ astro and — —0 axrz-oo. 
oe ez 


(c) We have 


log — co as £ > œ, 


but, for each constant a > 0, we have y Y5 e yr 


l s= 
SCLER) as © — oo. 


ge 


Part (b) tells us that, as x tends to infinity, e” tends to infinity faster than 
any positive integer power of x, as illustrated in Figure 12. On the other r - 
hand, part (c) tells us that, as x tends to infinity, log x tends to infinity 1 

more slowly than any positive power of x. Thus, in part (b) the dominant Figure 12 The graphs of 
term is e”, whereas in part (c) it is x. 


Proof of Theorem F13 


y =e, y =x andy=z 


(a) We use the fact that all the zeros of the polynomial p must lie in the 


interval (~M, M), where M = 1 + max{|an_1|,..-.,|a1|,|a@o|}, and 
p(x) >0, for xe(M,oœ). (5) 
®. This was proved in Theorem D54 of Unit D4. .@ 
Now for « £0, 
An-1 ao 
= 1 F 2) , 
p(z) a" (1+ T E 


By Theorem F11(b) and the Combination Rules, 


iae A ger paap as £ — OO. (6) 
£ x 
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Thus, for x € (M, oo), we have 
1 1/x” 0 
— = — M > -=0 a8 ron, 
plz) 1+an1/2+::: +ao/x” 1 


by statement (6), Theorem F11(b) and the Quotient Rule for limits. 
We deduce, by inequality (5) and the Reciprocal Rule, that 


p(z) 4 œ as t > œ. 


Let n be a fixed non-negative integer. We use the series representation 


2 x” ght 


y 
T 
=j e a a ied 
° eh a val ee” , 


which was shown to be valid for x > 0 in Subsection 4.1 of Unit D3. 
Since x > 0, all the terms in the above series are non-negative, so 


grt 
e€ > ——_., forr>0. 
(n+ 1)! 
Hence, for x > 0, 
x n ! 
Lo oo and ge o 
xr” ~ (n+1)! e7 £ 
It follows by Theorem F11, the Multiple Rule and the Squeeze Rule 
that 
x n 


x 
— >œ asxr—->oo and — —0 arom. 
GH e7 


It was shown in Subsection 4.2 of Unit D4 that the function 
xt+— log z is a strictly increasing inverse of the exponential function, 
with domain (0,00) and range R. We deduce that 


logz — œ as £z > o. 


Now let a be any positive constant. Since x* = exp(alog x), we make 
the substitution t = a log x, so that x! = et. For x > 0, this gives 
logr t/a t 


(7) 


xe eb act 
Since a > 0, we have 


t=alogr 4 œ astro, (8) 
and, by part (b) with n = 1 and the Multiple Rule, 


t 
aa as t + oo. (9) 


Hence, by statements (7), (8) and (9), together with the Composition 
Rule, we have 


log x t 
an +0 as z> oœ, 
xe aet 
as required. E 


3 Continuity 


Exercise F9 


Use the results of Theorem F13 and appropriate rules to determine the 
behaviour of the following functions as 7 — oo. 
3a? log x 2e" — x 
T 


@) Ja) = 5+ (b) fa) = © fe)= 


z? loggr e 
Hint: In part (b), express (log x)/e” in terms of (log x)/x and z/e”. 
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Exercise F10 


Prove that: 
(a) e® Jx? + 00 as £ > œœ (b) log(log x) > œo as x > co 


(c) xzsin(1/z) > 1 as £z > o. 


Hint: In part (c), use the substitution u = 1/z. 


3 Continuity: the classical definition 


In Book D you met a definition of continuity based on sequences, and saw 
that most familiar functions are continuous on their domains, a fact which 
is not at all surprising. However, there are many functions of interest for 
which it is more difficult to establish continuity (or discontinuity). 
Consider, for instance, the Riemann function, which has domain R and 
rule 


1/q, if x is a rational p/q, where q > 0, 
f(z) = FE 
0, if x is irrational. 


Note that, in this section, we assume that all rationals p/q are expressed in 
lowest terms; that is, the greatest common factor of p and q is 1. 

It follows from the definition that, for example, f(2/3) = 1/3 and 

f (V2) =0. By plotting the values of f(a) at different values of x we can 
produce a sketch of the graph of the Riemann function as shown in 

Figure 13. 


YA 
14 


EN“ ASAA ARAN AnA TA ADAN Ze 


Figure 13 A sketch of the graph of the Riemann function 


: the classical definition 
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From this sketch of the graph of f it is not clear whether the Riemann 
function is continuous at any point of R, but you will see (in 

Subsection 3.2) that in fact it is continuous at infinitely many points and 
discontinuous at infinitely many points! When dealing with such unusual 
functions, it is useful to have available the alternative definition of 
continuity which is introduced in Subsection 3.1. This definition looks 
more abstract but is more effective in some cases. 


The emergence of rigorous analysis 


The classical definition of continuity described in this section emerged 
towards the end of the nineteenth century after many years of debate 
amongst mathematicians about the rigorous formulation of analysis as 
the foundation of calculus. At this time, the informal approach used 
in the eighteenth century, for instance by Euler, was increasingly 
found to be inadequate. For example, around 1820, Joseph Fourier 
(1768-1830) used functions defined by infinite series to solve problems 
in the theory of heat. The properties of these series raised challenging 
questions about the meaning of convergence. This led to questions 
about the definitions of continuity, limits, differentiation and 
integration, and even the nature of the real numbers. These questions 
were not properly resolved until about 1870, after contributions by 
many mathematicians, including Bolzano, Cauchy, Riemann, 
Dirichlet, Dedekind, Weierstrass and Cantor. 


3.1 The e-ô definition of continuity 


The sequential definition of continuity that you studied in Unit D4 states 
that the function f : A —> R is continuous at c, where c € A, if 


for each sequence (zn) in A such that £n > c, 
f(n) > Fe). 


This definition uses sequences to formalise the intuitive idea that f(x) 
approaches f(c) as x approaches the point c in any manner. 


The new definition that we study in this unit formalises this idea in a 
somewhat different way, which we can describe in words as follows: 


we can make f(x) as close as we wish to f(c) by ensuring that x 
is close enough to c. 


The ‘closeness’ in this description is measured by two variables, € in the 
codomain and 6 in the domain, which represent ‘small’ positive numbers. 


3 Continuity: the classical definition 


Definition 
Let the function f have domain A and let c € A. Then f is 
continuous at c if 


for each £ > 0, there exists ô > 0 such that 
|f(z) — f(o)| <e, for all x € A with |x —c| < ô. (10) 


Remarks 


l; 


The above definition is quite subtle. It can be expressed in words as 
follows: no matter how small a positive number € we are given, we can 
choose a positive number 6 such that if the distance between x and c is 
less than 6, then the distance between f(x) and f(c) is less than e. 
Thus statement (10) can be interpreted as an implication: 


if x € A and |x — c| < ô, then |f(x) — f(c)| < €. 


. Note that 


|f(x) — f(o)|<e is equivalent to f(c)— e< f(x) < f(c) +6, 
lr—cel <6 is equivalent to c—ô< x< c+ ð, 


as illustrated in Figure 14. This shows that as x gets closer to c, f(x) 
gets closer to f(c). 


Figure 14 The £-ô definition of continuity 


3. 


Usually the value of 6 that we choose in order to make statement (10) 
true depends on the given value of e: the smaller £ is, the smaller 6 has 
to be. The value of 6 often depends also on the particular point c at 
which we are checking continuity. 
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We can interpret the task of finding a suitable choice of ô, when using this 
definition, as an ‘e-d game’ in which player A chooses a small positive 
number £ and then challenges player B to find a suitably small positive 
number 6 for which statement (10) is true. (This is like the ‘e-N game’ for 
null sequences that you met in Unit D2.) For example, suppose that 

f(x) = x? and c= 0. If player A chooses £ = 1/4 then player B can choose 
ô = 1/2 (or any smaller value) because if 


|x — 0| = |z| < 1/2, 
then 

f(x) — f(0)| = |z°| < 1/4. 
This is illustrated in Figure 15. 


ENV 
—B wins- 
UNIS 


Figure 15 The s-ô game for f(x) = x? at c=0 
In practice, we do not usually choose specific values for € when proving 
continuity, but this game illustrates the ideas involved. 


The method of applying the ¢-d definition of continuity depends on the 
nature of the function f. As an illustration, we apply the definition to 
polynomial functions, using the following strategy. 


Strategy F3 


To use the ¢-6 definition to prove that a polynomial function f with 
domain A is continuous at a point c € A, let € > 0 be given and carry 
out the following steps. 


1. Use algebraic manipulation to express the difference f(x) — f(c) as 
a product of the form (x — c)g(z). 


2. Obtain an upper bound of the form |g(x)| < M, for |a—c| < r, 
where r > 0 is chosen so that |c — r,c +r] C A. (The Triangle 
Inequality is often useful here.) 


3. Use the fact that | f(x) — f(c)| < M|x —cl, for |x — c| < r, to 
choose 6 > 0 such that 


|f(x) — f(o)| <£, for all x € A with |z —c| < ô. 


The following worked exercise shows how Strategy F3 can be used. 
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3 Continuity: the classical definition 


Worked Exercise F5 


Use the £c- definition to prove that f(a) = x? is continuous at c = 2. 


Solution 
The domain of f is R. 
Let £ > 0 be given. We want to choose ô > 0, in terms of £, such that 
| f(a) — f(2)| <e, for all x with |x — 2| < 6. (x) 
®. We follow the steps in Strategy F3. ® 
1. First we write 
f(x) =- FQ) = 1-4 = (z -2e +2), 


®. Writing f in this form will help us to show that if |x — 2| is 
small, then | f(x) — f(2)| is small. @ 


2. Next we obtain an upper bound for |x + 2| when z is near 2. 


®. We consider points for which |x — 2| < r with r = 1; any r > 0 
is suitable, but the resulting bounds will depend on r. ® 


If |x — 2| < 1, then z lies in the closed interval [1,3], so 

|z +2| < |æ| +2 (by the Triangle Inequality) 
SO 
3. Hence 

| f(x) — f(2)| < 5ļx — 2|, for ja —2| <1. 

So if |x — 2| < 6, where 0 < 6 < 1, then 
|f(@) — F(2)| < 56. 

Now 506 < € if and only if 6 < te. 


®. We now choose ô so that both of the inequalities 0 < 6 < 1 and 
O20 že are satisfied. & 


Thus, if we choose 6 = min{1, te}, then 
|f(x) — f(2)| <58 <5 x te =e, for all x with |x — 2| < ô, 
which proves statement (x). 


Thus f is continuous at the point 2. 


Exercise F11 


Use the e-6 definition to prove that f(a) = x? is continuous at c = 1. 


Hint: Note that z? — 1 = (x — 1)(x? + x + 1). 
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Next we verify that the two definitions of continuity are equivalent. 


Theorem F14 


The £-ô definition and the sequential definition of continuity are 
equivalent. 


Proof Let the function f have domain A, with c € A. First we assume 
that f is continuous at c according to the ¢-6 definition. We want to 
deduce that, 


for each sequence (zn) in A such that £n > c, 


f(tn) > Fle). (11) 
Let £ > 0 be given. Then, by the ¢-6 definition of continuity, there exists 
ô > 0 such that 


f(x) -—f(o)|<e, forall zE A with |r—c| < ô. (12) 
Since £n > c, there exists an integer N such that 

In—c| <6, foraln >N. 

Hence, by statement (12), 

f(tn) -—f(o)|<e, foraln >N. 


Thus statement (11) does indeed hold, so the sequential definition follows 
from the ¢-6 definition. 


Next suppose that f is continuous at c according to the sequential 
definition. We want to deduce that if € > 0 is given, then there exists ô > 0 
such that statement (12) holds. 


@. We use a proof by contradiction. .& 


Suppose that, for some € > 0, there is no such ô > 0. Then statement (12) 
must be false with 6 = 1, 6 $, ô $, and so on. Hence, for each n € N, 
there exists £n € A with |x, — c| < 1/n such that 


[f (En) — F(e)] > €. (13) 


Now, the sequence (zn) lies in A and £n — c. Thus, by the sequential 
definition of continuity, we have lim f(z,) = f(c), which contradicts 
n— o0 


inequality (13). We deduce that the ¢-d definition of continuity follows 
from the sequential definition. E 


3.2 Continuity of some unusual functions 


It is natural to ask which is the ‘better’ definition of continuity. It is 
difficult to give a definitive answer, but on the whole: 


e when proving the continuity of simpler functions the sequential 
definition is usually easier, whereas the ¢-6 definition can work better 
with more complicated functions 


e when proving discontinuity the sequential definition is usually easier. 


3 Continuity: the classical definition 


For most of the functions you have met so far in this module, the points 
where the functions are continuous have been ‘obvious’. However, there are 
many functions for which it is far less clear where they are continuous, if 
anywhere. In this subsection you will meet several interesting but quite 
complicated functions, and you will see that the ¢-6 definition is an 
effective means of proving continuity, even when it is not possible to use 
Strategy F3. 


The proofs in this subsection will take some effort to understand fully, so 
you may prefer to skim through them on a first reading and return to them 
as time permits. Do not be discouraged if you find them rather hard at 
first: reading proofs in analysis gets easier as you become more familiar 
with the sorts of arguments used. 


YA 
The Dirichlet function and the Riemann function y=1, rEQ 
The first function we consider has a simple definition, but is highly A Most 
discontinuous. The Dirichlet function has domain R and rule y = f(z) 
1, if x is rational, y=0, EQ 
f(z) = Seca ees 
0, if x is irrational. z 


The graph of f in Figure 16 looks rather like two parallel lines, but each Figure 16 The Dirichlet 
line has infinitely many ‘holes’ in it! function 


Theorem F15 


The Dirichlet function is discontinuous at every point of R. 


Proof Let c be any point of R. We show that f is discontinuous at c by 
using the sequential definition of continuity. 


®. We do this by showing that there are sequences that tend to c whose 

images under f tend to different limits. Recall the Density Property of R 
from Subsection 1.4 of Unit D1 Numbers: between any two real numbers, 
we can find both a rational number and an irrational number. ® 


By the Density Property of R, each open interval of the form 
(c—1/n,c+1/n), wheren €N, 


contains a rational x, and an irrational yn. Considering the sequences (£n) 
and (yn), we have £n > c and yn > c by the Squeeze Rule for sequences, 
but 


fär and -Flynn = 0; forn=1, 2 ss: 
Since (f(£n)) and (f(yn)) have different limits, f is discontinuous at c. E 
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Our next function shows even stranger behaviour. The Riemann function, 
which we introduced at the start of this section, has domain R and rule 


1/q, if xis a rational p/q, where q > 0, 
f(z) = ae 
0, if x is irrational. 


(Recall that, in this section, p/q is always expressed in its lowest terms.) 


YA 


EN ea SAR SN AMOR a feo ne As 


Figure 17 A sketch of the Riemann function 


As mentioned earlier, it is not clear from the sketch of the graph of f in 
Figure 17 whether the Riemann function is continuous at any point of R. 
In fact, it has the remarkable property that each open interval of R 
contains infinitely many points where f is continuous and infinitely many 
points where f is discontinuous. As you study the proof, notice how it 
illustrates the strengths of the two definitions of continuity. 


Theorem F16 


The Riemann function is discontinuous at each rational point of R 
and continuous at each irrational point. 


Proof Here we prove discontinuity using the sequential definition and we 
prove continuity using the ¢-6 definition. 


First we prove that f is discontinuous at rational points. 


®. Here we use Strategy D14 from Unit D4; that is, for each rational 
point c, we find a sequence that converges to c but whose images under f 
do not converge to f(c). ® 


Let c = p/q, with q > 0 (where p/q is expressed in lowest terms). Then, by 
the Density Property of R, each open interval of the form 


(c—1/n,c+1/n), where n €N, 


contains an irrational number zn. Considering the sequence (£n), we have 
Ln > cand f(£n) = 0, for n = 1,2,.... Since f(c) = 1/q #0, we have 
f(an) > f(c), so f is discontinuous at c. 


®, Recall that the notation - is read as ‘does not tend to’. & 
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3 Continuity: the classical definition 


Next we prove that f is continuous at irrational points. Let c be an 
irrational number in R. We must prove that 


for each € > 0, there exists 6 > 0 such that 
|f(z) — f(o)| <e, for all x with |z — c| < ô. (14) 
Let £ > 0 be given. Since c is irrational, we have f(c) = 0. Also, f(x) > 0 
for all x in R, so statement (14) can be rewritten as 
f(a) <e, for all x with |z — c| < ô. (15) 
®. Note that Strategy F3 cannot be used here, since f is not a polynomial 
function. & 


To obtain a value of ô such that statement (15) holds, we first choose a 
positive integer N such that 1/N < e. Then we let Sy denote the set of 
rationals p/q in the interval (c — 1,c +1) such that 0 < q < N. There are 
only finitely many elements of the set Sy, and c é Sy because c is 
irrational. Thus the number 


6 = min{|x — c| : x € Sy} 
exists and is positive. Therefore the open interval (c — 6,c+ 0) contains no 
rationals p/q with0 <q < N. 
Hence if |x — c| < 6, then 
either «x is irrational, so f(x) =0 < e€, 
or x = p/q with q > N, so f(x) =1/q < 1/N <e. 


In either case f(x) < £, so we have succeeded in choosing 6 > 0 such that 
statement (15) holds. Hence f is continuous at c. a 


In view of the strange properties of the Riemann function, it is natural to 
ask whether a function can also be found which is continuous at each 
rational point of R and discontinuous at each irrational point. It can be 
shown that no such function exists, but we do not prove this here. 


The blancmange function 


Our next function is in some ways even more unusual than the Dirichlet 
and the Riemann functions. To construct this function, we start with the 
sawtooth function illustrated in Figure 18: 


tafe if0<a—-|a| <3, 


1—(e-|z]), if}<2-|2| <1, 


Figure 18 ‘The sawtooth 
where |x] is the integer part function. function 


The blancmange function B is obtained by forming an infinite series of 
functions related to s: 


B(x) = s(x) + 55(22) + ts(4x) + $5(82) ai 2. 
`> Z 9(2"2) 


n=0 
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The properties of this function were first studied by the Japanese 
mathematician Teiji Takagi (1875-1960) in 1903. The name 
‘blancmange function’ was used by the English mathematician 
David Tall (1941-) in the 1980s. 


For example, to evaluate B(t) we find the sum of the corresponding series: 


B(z) = 8(3) + 98(5) + 38(1) + g8(2) + 
=4+4x}i+ix0+4 x04 


NI= Ble 


In this case, the series has only finitely many non-zero terms, but for 
some x the series for B(x) has infinitely many non-zero terms. However, 
since 0 < s(x) < 5, for x € R, we have 


1 1 
0< pn s(2" 2) < ar forz € RandneN, 
so the series defining B(x) is convergent for each x € R, by the 
Comparison Test for series; see Subsection 2.1 of Unit D3. 


To picture the graph of the blancmange function, we consider the graphs 
of several successive partial sum functions of the series for B, with domains 
restricted to [0,1]; see Figure 19. (In each case the graph of the previous 
partial sum function is in light dashes and the function being added is in 
heavy dashes. Thus the solid line is the sum of the graphs shown by 
dashed lines.) 


YA 
y = s(x) 


NI= 
fi 


Figure 19 Successive steps in the construction of the blancmange function 


3 Continuity: the classical definition 


The sum function B has the graph shown in Figure 20. 


7 
Hy 


a 
MN 
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Figure 20 The blancmange function 


The graph of B is very irregular, in the sense that it oscillates rapidly up 
and down, and does not appear to be smooth at any point. In fact, in 
Unit F2 you will see that the function B is nowhere differentiable! 
However, it does seem that the function B is continuous, and we can show 
that this is true. 


Theorem F17 


The blancmange function is continuous. 


Proof ®. We use the e-6 definition of continuity. & 
Let c € R. We want to show that 
for each € > 0, there exists 6 > 0 such that 
|B(x) — B(c)| <£, for all z with |æ — c| < ô. (16) 
Let £ > 0 be given. We first write 


Hence, by the infinite form of the Triangle Inequality (see Theorem D35 in 
Subsection 3.1 of Unit D3), 


oe) 


|B(x) — B(c)| < ` zz [s(2”x) — s(2”c)]|. (17) 


For all x and c, and n = 0,1,2,..., both s(2”x) and s(2”c) lie in the 
interval [0, 3], so 


|s(2"x) — s(2"c)| < 4, forn=0,1,2,.... (18) 
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Now we choose an integer N such that 1/2" < $e and consider the ‘tail’ of 
the series in inequality (17), starting from the term n = N. 


®. Such an N exists because (1/2”) is a basic null sequence. Splitting the 
series into two different parts will enable us to use different methods for 
each part. We bound each part by le to give an overall bound of c. ® 


By inequality (18), we have 


z a il 
D lsh) — 8(2"¢)| < 5 > a 
n=N n=N 
if 12% l oi 
es = — < łe. 19 
s(a) gN < 2° a 
>E 
®. Here we have used the fact that > on is a geometric series with 
n=N 
[0,6] 
i. 1⁄2 
= 1/2" andr = 1/2; — = . 
gee" and ¢=1/ °°) on 1-172 


Next we consider the rest of this series. Each of the functions 

gr s(2"r), n=0,1,...,N—1, 
is continuous. 
®. We omit the proof of this — you may like to check it for yourself. & 
Therefore, for each n = 0,1,...,N — 1, there is a positive number ôn such 
that 

|s(2"x) — s(2"c)| < fe, for all x with |x — c| < ôn. 
®. You will see a bit later in the argument why it makes sense to choose 
fe here. @ 
Thus if 6 = min{ ôo, 61,...,dn—1} and |x — c| < 6, then 


N-1 N-1 
1 


» sals(2"2) — s(2"c)| < > Ji (te) < 2x te= le. 


co 
1 
®. Here we have used the fact that jp — =2, %@ 
QN 
n=0 
Combining this inequality with inequalities (19) and (17), we obtain 
statement (16) with this choice of 6. Hence B is continuous at the 
point c. E 


The blancmange function is very irregular, but it exhibits patterns known 
as ‘self-similarity’. However closely you look at the graph, you can see 
‘mini-blancmanges’ growing on it everywhere. The existence of these 
mini-blancmanges can be explained by rewriting the series defining B: 


B(x) = s(x) + $5(22) + łs(4x) + +5(82) hee 
= s(x) + (s(2z) + 55(4ar) + 75(82) Jai -) 


3 Continuity: the classical definition 


The graph of the function x +> 3B (2x) is just the graph of B scaled by 
the factor i in both z- and y-directions. Hence the graph of B is the graph 
of s with a (sheared) 4-size blancmange growing on each sloping line 
segment. (You met shears in Unit C3 Linear Transformations.) Smaller 
mini-blancmanges can be explained in a similar manner. This is illustrated 


in Figure 21. 


1 


q-size blancmange 


sheared $-size blancmange 


ni 
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Figure 21 The self-similarity of the blancmange function 


Such irregular sets, which display self-similarity, are studied in detail in the 
subject known as ‘fractals’. 


3.3 Limits and other asymptotic behaviour 


In Sections 1 and 2 we defined limits and other types of asymptotic 
behaviour using a sequential approach. Each of these concepts can also be 
defined in a way that is analogous to the £-ô definition of continuity. For 
example, we can define the concept of a limit as follows. 


Definition 
Let f be a function defined on a punctured neighbourhood N,.(c) of c. 
Then f(x) tends to the limit l as x tends to c if 


for each £ > 0, there exists 6 > 0 such that 
|f(z) -I]<e, for all x with 0 < |x —c| <0. 


As before, we write 


lim f(a) =1 or f(r) 731 asx. 
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Remarks 


1. This definition is very similar to the £-ô definition of continuity, except 


that f(c) is replaced by l and |x — c| < ô is replaced by 0 < |x —c| < ô. 
This reflects the fact that when we try to find the limit of f(x) as x 
tends to c, the value of f(x) at x = c is not relevant and indeed may not 
be defined. 


2. The proof that the above definition is equivalent to the sequential 


definition of a limit is similar to the proof of Theorem F14. 


3. Analogous definitions can be constructed for one-sided limits and for all 


the types of asymptotic behaviour you have met. 
For example, lim f(x) =1 if 
T00 
for each £ > 0, there exists K € R such that 
|f(z)-U<e, foral z with z > K. 


A brief history of the limit concept 


Recent historical studies have shown that, contrary to what was 
previously believed, Isaac Newton (1642-1727) had a good grasp of 
the limit concept, recognising that limits provide a secure foundation 
for calculus. Jean le Rond d’Alembert (1717-1783) in his article in the 
French Encyclopédie (published between 1751 and 1765) provided a 
definition of limit very close to that of Newton: a bound that could be 
approached as closely as one chose. However, because d’Alembert, like 
Newton, worked with examples that were primarily geometric, there 
was no need to consider quantities that might oscillate from one side 
of a limit to the other. 


In 1821 Augustin-Louis Cauchy (1789-1857), in his Cours d’Analyse, 
provided a definition of a limit which combined the same ideas as that 
of d’Alembert: the existence of a fixed value and the possibility of 
approaching it as closely as one wishes. Although Cauchy was the 
first to use €-d arguments in his proofs, he never gave an explicit ¢-d 
definition of a limit. Unknown to Cauchy, in 1817 the Bohemian 
theologian and mathematician Bernard Bolzano (1781-1848) had 
already introduced a rigorous £-ô definition of a limit, but his work 
was not well known and had only indirect influence on later 
developments. The e-6 definition used today was first formalised by 
the German mathematician Karl Weierstrass (1815-1897) in his 
lectures in Berlin in the 1860s. 


The next worked exercise gives an example of a limit evaluated using the 
é-6 definition. (In fact, for this example, the sequential definition is easier 
to use; see Worked Exercise F1(a).) 


3 Continuity: the classical definition 


Worked Exercise F6 


Use the £-ô definition of a limit to evaluate 


Exercise F12 


Use the £-ô definition of a limit to evaluate 
_ 2234+ 32-5 
lim — 
al a—-l1 


Hint: Use the fact that 223 + 32 — 5 = (x — 1)(2x? + 22 + 5) and follow 
Strategy F3 for using the ¢-6 definition of continuity. 
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4 Uniform continuity 


In Section 3 you met an alternative approach to continuity, based on the 
e-6 definition. In this section you will see how this approach can be used to 
describe a stronger notion of continuity, which plays a key role in our later 
work on the integration of continuous functions in Unit F3 Integration. 


4.1 What is uniform continuity? 


The ¢-6 definition of continuity states that a function f is continuous at a 
point c in an interval J in the domain of f if 


for each £ > 0, there exists 6 > 0 such that 
\f(z) —f(o)|<e, forall xe with |z — c| < ô. 


In this definition we cannot expect that, for a given positive number e, the 
same positive number 6 will serve equally well for each point c in I. 
Sometimes, however, this does happen, in which case: 


for each € > 0, there exists 6 > 0 such that 
\f(z) — f(c)|<e, forall z,c ET with |x — c| < ô. 


Informally, for all c in the interval J, as x gets closer to c, the values f(z) 
get closer to f(c) at least as quickly as some uniform rate. We make the 
following definition. 


Definition 
A function f defined on an interval J is uniformly continuous on I 
if 
for each € > 0, there exists ô > 0 such that 
e — f(y)|<e, for all z,y € I with |x = <6. (20) 


Remarks 


1. In this definition we have used the variables x and y, rather than x 
and c, to indicate that these two variables are of equal standing. Notice 
that uniform continuity is defined on an interval: it is meaningless to 
say that a function is uniformly continuous at a point. 


2. We say that c is an interior point of an interval J if c is not an 
endpoint of J. It follows from the above definition that if f is uniformly 
continuous on an interval J, then f is continuous at each interior point 
of I. At an endpoint of J, the function can be discontinuous because of 
its behaviour outside I. 


3. If f is uniformly continuous on an interval J, then f is uniformly 
continuous on any subinterval of I. 


4 Uniform continuity 


Worked Exercise F7 


Prove from the definition that f(x) = 2? is uniformly continuous on 
I =[-4,4]. 


Solution 
Let £ > 0 be given. We have 
= AO) Ses a = ania) 
Hence, for x,y € [—4, 4], 
[f(x) — F) = |z + yl |e — yl 
< (|x| + |y|)|a —y| (by the Triangle Inequality) 
since |x| < 4 and |y| < 4. 


Thus, if we choose 6 = Ze, then whenever x,y € [—4, 4] and 
|x — y| < 6, we have 


|f(z) — f(y)| < 8lz — y| 
< 8x te =g. 
Hence f is uniformly continuous on |—4, 4]. 


®@. In Exercise F13(b) you will see that f(x) = x? is not uniformly 
continuous on R. It is, however, uniformly continuous on all bounded 
closed intervals, as you will see in Subsection 4.2. © 


In the next worked exercise we consider the function f(x) = 1/a. In 
Unit D4 you saw that this function is continuous on its domain, and hence 
n (0, 1]. We now show that it is not uniformly continuous on (0, 1]. 


Worked Exercise F8 


Prove that f(a) = 1/z is not uniformly continuous on I = (0, 1]. 


Solution 


®. To prove that a function is not uniformly continuous, we need to 
show that the negation of statement (20) holds. © 


We have to find £ > 0 such that, no matter which ô > 0 is chosen, 
there are points x and y in I with |z — y| < 6 and |f(zx) — f(y)| 2 e. 
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®. We sketch the graph of the function f. 


I il cf 


The graph suggests that, for any positive 6, we should take x and y 
near 0 because then x and y are close together but f(x) and f(y) can 
be far apart. & 


We try x = 56 and y = ô, where 0 < 6 < 1. Then 
|x — y| = |48 — ô| = 56 < ô 

and 

_ 2 


a >1 
Sree 


I5 ô 


Hence the negation of statement (20) holds with € = 1, so f is not 
uniformly continuous on I. 


The reasoning in this solution is quite subtle and can be tricky to apply, so 
it is useful to reformulate what it means to say that a function is not 
uniformly continuous on an interval. Roughly speaking, this happens if 
you can find pairs of points in the interval, as close together as you like, 
whose images are not close together. We prove the following result, 
illustrated in Figure 22. 


Theorem F18 


Let the function f be defined on an interval J. Then f is not 
uniformly continuous on T if and only if there exist two sequences (£n) 
and (yn) in J, and € > 0, such that 


1. |En — Yn| > 0 as n > œ 
D M m fe), tor w= anns 


| 
| 
| 
A 
-Y2 T2 Y1 Tı £ Proof ® We start by proving the only if statement. & 


Figure 22 A function which First suppose that f is not uniformly continuous on J. Then there exists 
is not uniformly continuous € > 0 such that for all 6 > 0 there are points x and y in I with 


lz—yl <6 and |f(x)— f(y)| e. 
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Applying this fact with ô = 1, 6 $, ô $, and so on, we obtain 


sequences (£n) and (yn) in I such that 
1 
|En — Yn| < 7 and |f(an)—f(yn)|>e, forn=1,2,.... 


Thus statements 1 and 2 both hold. 


On the other hand, suppose that there exist sequences (£n) and (yn) in J, 
and £ > 0, such that statements 1 and 2 hold. 


®. We use a proof by contradiction to show that f is not uniformly 
continuous on J. ® 


If f is uniformly continuous on J, then there exists 6 > 0 such that 
|f(z) — f(y)|<e, for all x,y € T with |x — y| < ô. 

But |£n — Yn| < ô, for n > N say, by statement 1, so 
If(@n) — F(yn)| <E, forn>N, 


contradicting statement 2. Thus f is not uniformly continuous on J. | 


Theorem F18 gives us the second part of the following strategy; the first 
part is an elaboration of the definition of uniform continuity. 


Strategy F4 


e To prove that a function f is uniformly continuous on an interval J, 
find an expression for ô > 0 in terms of a given € > 0 such that 


\f(z) — f(y)|<e, for all z,y € I with |x — y| <6. 


e To prove that a function f is not uniformly continuous on an 
interval I, find two sequences (£n) and (yn) in J, and € > 0, such 
that 


|En — Yn| 20 as n —> oo, and 
Cen te Nl es eae LOR 77 y Byasa 


When using part 2 of Strategy F4, you should aim to choose the terms x, 
and yn close together at points of J where the graph of f is steep; see 
Exercise F13(b), for example. 


We could have applied part 2 of Strategy F4 in Worked Exercise F8 by 
taking £n = 1/(2n) and yn = 1/n, for n = 1,2,..., since (£n) and (yn) lie 
in I: 


1 il 1 
|En — Yn| = |> — -| = — > 0 as n —> oo 
2n n 2n 
and, for n = 1,2,..., 
1 1 1 1 
seen) - sll- l hl e 
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Thus, by taking £ = 1 in part 2 of Strategy F4, we deduce that f is not 
uniformly continuous on I. 


Try the next exercise using Strategy F4. 


Exercise F13 


(a) Prove that f(x) = z? is uniformly continuous on I = [—2, 2]. 
(b) Prove that f(x) = x? is not uniformly continuous on J = R. 


Hint: In part (b), take £n = n + 1/n and y, =n, for n =1,2,.... 


4.2 A condition that ensures uniform 
continuity 


Checking uniform continuity from the definition can be complicated. 
However, we can often deduce uniform continuity in a straightforward way 
from the following fundamental result. Like the Intermediate Value 
Theorem and the Extreme Value Theorem that you met in Section 3 of 
Unit D4, this result is another illustration of the fact that continuous 
functions on bounded closed intervals have particularly nice properties. 


Theorem F19 


If the function f is continuous on a bounded closed interval [a,b], then 
f is uniformly continuous on fa, b]. 


You saw in Worked Exercise F7 that the function f(x) = x? is uniformly 
continuous on [—4, 4]. In fact this can be deduced immediately from 
Theorem F19, as follows. The function f(x) = x? is continuous on the 
whole of R, and hence on any bounded closed interval, so it must be 
uniformly continuous on any bounded closed interval, by Theorem F19. 
However, f(x) = x? is not uniformly continuous on the set R, as you saw 
in Exercise F13(b): the image values of points near to a point c approach 
f(c) more and more slowly as the point c gets larger and larger. 


Exercise F14 


Use Theorem F19 to prove that f(x) = x? is uniformly continuous on 
I = |-2, 2]. 


Proof of Theorem F19 We assume that f is continuous on [a,b] but 
not uniformly continuous on fa, b], and deduce a contradiction using the 
bisection method. 


®. You met the bisection method in the proofs of the Intermediate Value 
Theorem and the Extreme Value Theorem in Section 3 of Unit D4. & 


Since we are assuming that f is not uniformly continuous on [a,b], it 
follows from Theorem F18 that there exist sequences (xn) and (yn) in [a,b] 
and € > 0, such that 


|En — Yn| 20 as n —> oo, and (21) 
Iln =f Se tort = 1 Qyece. (22) 


Let ap = a, bb = b and p= Z (ao + bo). Then at least one of [ao, p] or [p, bo] 
must contain terms x, for infinitely many n € N. We denote this interval 
by [a1, b1], choosing either interval if both contain infinitely many 

terms zn. Thus we have: 


iL; (a1, bı] C ao, bo] 
2. bı — a, = $ (bo — ao) 
3. [a1, b1] contains terms 2, for infinitely many n € N. 


Now we repeat this process, bisecting [a1, bi] to obtain [a2, b2], and so on. 
This gives a sequence of closed intervals 


et, bal, 01,9. 226 
such that the following properties hold for k = 0,1,2,... 
1. [ap41, bk+1] C lax, bk] 
2. by — ap = (4)* (bo — ao) 
3. [az, bk] contains terms £n for infinitely many n € N. 
®. We use k here to avoid n having two meanings. & 


Now property 1 implies that (ax) is increasing and bounded above by bo. 
Hence (aj) is convergent by the Monotone Convergence Theorem (see 
Section 5 of Unit D2). 


Moreover, if we let jim ak = C, then it follows that im, by = c also, by 


property 2 and the ‘Combination Rules for Raie. d by property 3, 
the sequence (zn) contains a subsequence (£n,) such that 


Tn, E lak, bk], for k = 0,1, 2,0. 


Thus jim In, = C, by the Squeeze Rule for sequences, so 
— 00 


Yn, = (Yny, — Try) + 2n, > 0+c= 0, 
by statement (21). It now follows from the continuity of f at the point c 
that 

lim F (Eng) = f(e) and pm, F (Yn) = AOF 


k- oo 
but this contradicts statement (22). Hence our original assumption that f 


is continuous but not uniformly continuous on [a,b] must be false. This 
completes the proof. E 
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As part of this proof, we showed that any sequence (£n) in [a,b] contains a 
convergent subsequence (ap). This remarkable result, which is of 
importance in many parts of analysis, is called the Bolzano- Weierstrass 
Theorem. It can be stated as follows. 


Theorem F20 Bolzano—Weierstrass Theorem 


Any bounded sequence has a convergent subsequence. 


Summary 


In this unit you have studied the limiting behaviour of a real function f 
near a point c, using the concept of a punctured neighbourhood of a 
point c. You have seen that f(a) has limit f(c) as x tends to c precisely 
when f is continuous at the point c. You have also studied the limiting 
behaviour of f as x tends to infinity, and seen what it means for the 
function values f(x) to tend to infinity as x tends to c. You have learnt 
how to use a variety of rules to help you analyse such asymptotic 
behaviour and to evaluate different limits. 


You have also met a new definition of continuity: the ¢-d definition of 
continuity. You have seen that this is equivalent to the sequential 
definition of continuity that you studied in Unit D4 and, although it is 
more abstract than the sequential definition, it can be easier to use in 
certain situations. In particular, you have used both definitions to prove 
different properties of some interesting functions: the Dirichlet function, 
the Riemann function and the blancmange function. 


Finally you have learnt what it means for a function to be uniformly 
continuous on an interval. You have seen how to give a direct proof that a 
function has this property and met the result that a function that is 
continuous on a bounded closed interval is uniformly continuous. 


As you continue your studies, you will see that the ideas you have met in 
this unit play a key role in the foundations of calculus. In particular, limits 
are fundamental in the theories of differentiation and integration, and 
uniform continuity is used to show that a function that is continuous on a 
bounded closed interval can be integrated. 


Learning outcomes 


Learning outcomes 


After working through this unit, you should be able to: 


understand the statement lim f(x) = 1, or f(z) 3 las z >c 
te 


appreciate the relationship between limits of a function and continuity 


use the Combination Rules, Composition Rule and Squeeze Rule to 
calculate limits of functions 


understand one-sided limits, and the statements lim f(x) =l and 
xr—->C 


lim fie) =1 
ge 
use certain basic limits to find other limits 


understand the asymptotic behaviour of functions, and the statements 
f(z) — œ as x —> c (or c* or c) and f(x) — las x — oo (or —o0) 


use the Reciprocal Rule, the Combination Rules, the Squeeze Rule and 
the Composition Rule to determine the asymptotic behaviour of 
functions 


understand and use the e—ô definition of continuity 


describe functions which are highly discontinuous, or continuous but 
very irregular 


understand and use the c—ô definition of a limit 
understand the definition of uniform continuity and use it in simple cases 


know and use various conditions for uniform continuity. 


49 


Unit F1 Limits 
Solutions to exercises 


Solution to Exercise F1 
(a) The function 
r? +2 


x 


f(x) = 


has domain R — {0}, so f is defined on any 
punctured neighbourhood of 0. Also, 


f(a) = A0 


Thus if (xn) lies in R — {0} and £n > 0, then 


=ax+1, forxr #0. 


f(@n) =I@t+11. 


Hence 
2 
ee ae 
lim =|; 
x0 Y 


(b) The domain of f(x) = |x] is R, so f is defined 


on any punctured neighbourhood of 1. Now, 
f(x) = 0, 
and 
f(x) =1, 


The two sequences (1 + 1/n) and (1 — 1/n) both 
tend to 1, and have terms lying in R — {1}. Also, 


lim f(l+1/n)=1 but jim fl — 1/n) = 0. 


for0 <a <1, 


forl<a< 2. 


Hence lim |x| does not exist, by the first part of 
xr 
Strategy F1. 


(c) The function f(x) = log |x| has domain 

R — {0}, so it is defined on any punctured 
neighbourhood of 0, and its graph is as follows. 
(This is included for interest — you are not 
expected to have sketched the graph as part of 
your solution.) 


YA 
y = log |z| 
HH 
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We consider the null sequence zn = 1/n. Then 
f (an) = log |zn| = log(1/n) = — logn. 
Now logn — co as n > œ, so f(£n) 4 —0o 
as n —> Oo. 
Hence lim log |x| does not exist, by the second part 
of Strategy F1. 


Solution to Exercise F2 


(a) The domain of the continuous function 

f(x) = vz is the interval [0, co), so f is defined on 
the open interval (0,00), which contains 2. Hence, 
by Theorem F2, we have 


lim vz = V2. 
r2 


(b) Since sing > 0 for 0 < x < m, the function 
f(x) = vsin z is defined on the open interval 
(0,7), which contains 7/2, and is continuous, by 
the Composition Rule for continuous functions. 
Hence, by Theorem F2, we have 


a vsin z = ysin(t/2)= 1. 
rn /2 


(c) The function f(x) = e*/(1 + x) is defined on 
the open interval (—1,00), which contains 1, and is 
continuous, by the Quotient Rule for continuous 
functions. Hence, by Theorem F2, we have 

x 1 


Nile 


lim 


ide te 2 


Solution to Exercise F3 


(a) First we write 


sin x O sin x O sin x 1 
Qe+a2 a2(2+2) \ z 2+a) 


lim =1, (by Theorem F1) 
z>0 T 
and 
lim = $, 
r=>02 +r 
because is continuous at 0, we deduce by the 


Product Rule for limits that 
sin x 


lim = 
x0 27 + x? 


(b) We can write 


= g9(f(z)), 


— 1d 
=1x35 z 


sin(sin x) 
sin x 
sin x 


where f(x) = sing and g(x) = = 


Substituting u = f(x) = sin x and using the fact 
that u is continuous at 0, we have 


u=sinz—>sin0=0 as z — 0 
and 


sin u 
>+>lasu-0. 


g(u) = 


The first proviso to the Composition Rule holds, 
because f(x) = sing 4 0 in N,(0), for example. 


Thus, by the Composition Rule, 


g(f(2)) = 


(c) We can write 


5” = 9 F( 


sin(sin x 
sin(sin «) >laszx—-0. 
sin x 


£)); 


where f(x) = me and g(x) = 1/2'/?. 
Substituting u = f(x) = = we have 
— Z = 1 asz — o0, 
z 


glu) = 1/ul/? > 1/12 =1 asus, 


since g is continuous at 1, which also tells us that 
the second proviso to the Composition Rule holds. 


Thus, by the Composition Rule, 


oe) = (<— 
(d) Using the hint, we obtain 


x 


1/2 
) >laszx—0. 


1— cosg _ 2sin? (4x) 
— s 
(i 
= sn (a2) x sin ($2) . 
z 


Solutions to exercises 


by Worked Exercise F3(a), and 
lim sin (52) 0, 


by the continuity of the function £z > sin (52). 
Thus, by the Product Rule, 


Solution to Exercise F4 


(a) Since 
1 
1+a<e* < ——, for |a| <1, 
l-g 
we have 
C ee se for |z| < 1 
Thus if 0 < x < 1, then 
e”—1 1 y |x| 
1< < ne Sp E 
= a “ieg "Tae Tieg 
and if —1 < x < 0, then 
T 
yee ee 2 -1-2 
x l=% l= g l= 


Since we also have 


|z| 


<1 when0O<2<1 


i 
1 


and 
|z| 
1-2 


we deduce that the inequalities 


1+ >1 


when —1<2<0Q, 


T 
ool eh aga Ne: 


l-z x 


iP 
l-g 


hold whenever 0 < |x| < 1, as required. 
(b) We have 
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because both functions are continuous on the 
interval (—1, 1). 


Thus, by part (a) and the Squeeze Rule, 


x—>0 p 
Solution to Exercise F5 


(a) Since lim ZIT L 1 and the function z > yz 
z>0 g£ 


is continuous at 0, we have 


=1 (by Theorem F7) 


lim vyx =0 (by Theorem F8). 


a—0t 


Hence, by the Sum Rule, 


(b) We can write 
sin fz 
where f(x) = yz and g(x) = a 
Bp 
Substituting u = f(x) = yx, we have 


— + 
u= yz > 0 as z > 0”, 


glu) = ee ee eee 0, 

so that 
im = 4 (by Theorem F7). 
u0t 


Moreover, f(x) = yz #0 on the open interval 
(0,1), so the first proviso of the one-sided limit 
version of the Composition Rule holds. 


Thus, by the Composition Rule, 


sin yx 


g(f(x)) = —>1 asg — 0t. 


Va 
Solution to Exercise F6 


(a) Let f(x) = |z|; then f(x) > 0 for x € R — {0}, 
and 


lim |z| = 0, 
«2-0 
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since f is continuous at 0. Hence, by the 
Reciprocal Rule, 


1 1 
T = ial >œ asr—>0. 
(b) Let f(x) = z?/ sin z; then 
y 
F(z) = paas >0, frre Nz (0), 
and 
r? 0 
fla Gnas as x > 0, 


by Theorem F6(a) and the Quotient Rule, since 
the function x ++ x? is continuous at 0. 


Hence, by the Reciprocal Rule, 
1 sing 
f(z) 
(c) Let f(x) = 2° — 1; then f(z) > 0 for 
x € (1,00), and 


>œ as z — 0. 


lim 2? -—1=0, 
x—l+ 


since f is continuous at 1. Hence, by the 
Reciprocal Rule, 
1 1 A 
== > œ ast >l”. 


f(z) =i 


Solution to Exercise F7 


(a) Since 


2L? +r 1 
f(z) = +5, fora #0, 


we deduce, by Theorem F11(b) and the Sum Rule, 
that 


1 
f(z) =2+472+0=2 as z > o. 
z£ 


(b) Let 


f(x) = Ope 1’ 


then dividing both the numerator and the 
denominator by the dominant term x*, we obtain 


Lig 0 
fe) = sys > 340 
by Theorem F11(b) and the Combination Rules. 


=0 a rou, 


Hence, by the Reciprocal Rule, 
1 2a? +1 
f(z) 
Solution to Exercise F8 
Since —1 < sin(1/z) < 1 for x £0, we have 
sin(1/z) < 1 
£ x 
Also, by Theorem F11(b) and the Multiple Rule, 


—> co as TtT —> o0. 


1 
—-< for x € (0,00). 
x 


1 
g(x) =-- > 0 as z> œ 
£ 
and 
1 
h(a) =—-—>0 as z > oo. 
z 


Thus, by the Squeeze Rule, part (a), 


in(1 
s 9 as £ — œo. 
Ba 


f(z) 


Solution to Exercise F9 
(a) By Theorem F13(b), we have 


x 
zw ar OO. 
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By Theorem F13(c), we have 


log x 


5 >0 as zt — œ. 
x 


Also, (log x)/z? > 0 for x > 1, so 


r2 


TT —+ œ as T > 00, 

by the Reciprocal Rule. 

Hence, by the Combination Rules, 
e? 3x? 


x2 log x 


= 00 as T> O0. 


(b) Following the hint, we write 


logx  /logx x 
ec ( x ) ) ` 
By Theorem F13(b) and (c), 


log x 
DSt 46 as © — œo. 
x 


x 
=e asxz@—>oo and 
e 


Thus, by the Product Rule, 
log x 


ex 


>0x0=0 as z> œ. 


Solutions to exercises 


(c) The dominant term is e”, so we write 

2e” =r?  2-—g?/e 

e? +logx 14+ (logx)/e* 
Thus, by part (b) above, Theorem F13(b) and the 
Combination Rules, 

2e” — z? 2-0 


> =2 — o. 
e7 + log x 1+0 ee 


Solution to Exercise F10 


(a) We can write 


where f(x) = x? and g(x) = e”/z. 
Substituting u = f(x) = x7, we have (by 
Theorem F13(a) and (b)) 


u = r? > œ as T > OO, 


eu 


g(u) = — > œ as u > oo. 
u 


Thus, by the Composition Rule, 


x2 


g(f(2)) =< > 00 as £ = 00. 
(b) We can write 

log(log x) = g(f()), 
where f(x) = log x and g(x) = log x. 


Substituting u = f(x) = log x, we have (by 
Theorem F13(c)) 


u=logx 4 œ as © > œ, 


glu) = logu => œ as u > oo. 
Thus, by the Composition Rule, 
g(f(x)) = log(log x) + 00 as z > oo. 


(c) We can write 


vsin(1/2) = UP) — gf), 
where f(z) = and g(x) = me 


Substituting u = f(x) = 1/x, we have 


u =1/r—=>0 asz— oo (by Theorem F11(b)), 
_ sinu 


g(u) = 


>1lasu-0 


(by Theorem F1). 
u 


53 


Unit F1 Limits 


Thus, by the Composition Rule, 


g(f(x)) = zsin(1/xz)—> 1 as z > oo. 


Solution to Exercise F11 
The domain of f(x) = z? is R. 


Let £ > 0 be given. We want to choose ô > 0, in 
terms of £, such that 


|f@æ)- fA) < e, 
We follow the steps in Strategy F3. 


for all x with |z—1| < ô. (x) 


1. First we write 
f@)- F0) =23 -1=(e@-D(e? +241). 
2. Next we obtain an upper bound for |£? + 2 +1| 
when z is near 1. If |x —1| < 1, then z lies in the 
interval [0,2], so (by the Triangle Inequality) 
ln? +a +1] < |? + [ol +1 
ao $2 l= 7, 


3. Hence 
f(z) -—f(1)| <7|2-1], for |jt—1| <1. 
So if |z — 1| < 6, where 0 < 6 < 1, then 
f(x) — F| < 76. 
Thus, if we choose 6 = min{1, ze}, then 
f(a) — FO) < 76 < e, 


for all x with |x — 1| < ô, 
which proves statement (*). 


Thus f is continuous at the point 1. 


Solution to Exercise F12 
The domain of 


3 _ 
TE 22° +3x-— 5 


g= l 
is R — {1}, so f is defined on each punctured 
neighbourhood of 1. Also, for z Æ 1, 
Qe +3xr—5 (x— 1)(2x? + 2r + 5) 
7 7 (x — 1) 
= 2x7 + 2g +5. 


f(a) 


x—-1 


This suggests that 
lim f(z) =2x 17+2x1+5=9, 
z—1 
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so we must prove that 
for each € > 0, there exists 6 > 0 such that 
| f(x) — 9| < e€, for all x with 0 < |z — 1| < ô. (*) 
1. First we write, for x Æ 1, 
f(£)-9= 2r +2r+5-9 
= 2(x? + x — 2) 
=2(x — 1)(x + 2). 
2. Next, if |x — 1| < 1, then z lies in the interval 
[0,2], so (by the Triangle Inequality) 


2(x + 2)| < 2(|2| +2) < 2(2 + 2) = 8. 
3. Hence 
f(x)— 9| < 8ļx — 1|, for 0< |z-—1|< 1. 
So if 0 < |x — 1| < ô, where 0 < 6 < 1, then 
f(x) — 9| < 86. 
Thus, if we choose ô = min{1, ze}, then 
f(x) —9| < 86 < e, 


for all x with 0 < |x — 1| < ô, 
which proves statement (*). 


Hence 


Solution to Exercise F13 
(a) We use part 1 of Strategy F4. Let £ > 0 be 
given. We have 
f(e)- fg) =2? -y° 
= («—y)(2* +ay +’), 
so for x,y € [—2, 2] (by the Triangle Inequality), 
If(x) — FY) = |x - | la? +2y + y?| 
< (æl? + Jal ly] + lyl*)l@ — yl 
since |x| < 2 and |y| < 2. 


Thus, if we choose 6 = be, then whenever 


x,y € [—2,2] and |z — y| < 6, we have 
|f(x) — F(y)| < jz —y| < 12x He =e. 


Hence f is uniformly continuous on [—2, 2]. 


(b) Following part 2 of Strategy F4 and the hint, 
we take £n =n+1/n and yn =n, for n =1,2,.... 
Both sequences lie in J = R and 
[En — Yn| = |(n + 1/n) — n| 
=1/n->0 asn > oo, 


= (n+ 1/n}? - n? 


= n? + 2n(1/n) + (1/n)? — n? 


= 2+ 1/n? > 2, for n = 1,2)... 


Thus, by taking £ = 2 in part 2 of Strategy F4, we 
deduce that f is not uniformly continuous on R. 


Solution to Exercise F14 


Since f(x) = x? is continuous on its domain R, we 
deduce that f is continuous on the bounded closed 
interval |—2, 2]. Thus f is uniformly continuous on 
[—2, 2], by Theorem F19. 


Solutions to exercises 
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Unit F2 
Differentiation 


1 Differentiable functions 


Introduction 


In Unit D4 Continuity you studied continuous real functions and saw that 
they share some important properties; for example, they all satisfy the 
Intermediate Value Theorem, the Extreme Value Theorem and the 
Boundedness Theorem. However, many of the most interesting properties 
of functions are obtained only when we further restrict our attention to 
differentiable functions. 


You have already met the idea of differentiating a given real function f; 
that is, finding the gradient of the tangent to the graph y = f(x) at those 
points of the graph where a tangent exists. The gradient of the tangent at 
the point (c, f(c)) is called the derivative of f at c, and is written as f’(c). 
In this unit we investigate which real functions are differentiable, and we 
discuss some of the important properties that all differentiable functions 
possess. 


Many of the techniques of differentiation and the properties of 
differentiable functions covered in this unit will not be new to you, but our 
main focus here is on establishing a rigorous foundation for these ideas. 
We will make frequent use of results on limits and continuity that you met 
in previous analysis units. 


1 Differentiable functions 


In this section we define what it means for a real function f to be 
differentiable at a point c, and we establish that certain basic functions are 
differentiable and find their derivatives. We also consider functions which 
possess higher derivatives, that is, functions which can be differentiated 
more than once. Finally, we show that differentiable functions are 
continuous, and also prove that the blancmange function, which was shown 
to be continuous in Unit F1 Limits, is in fact not differentiable at any 
point in its domain. 


1.1 What is differentiability? 


Differentiability arises from the geometric concept of the tangent to a 
graph. The tangent to the graph y = f(x) at the point (c, f(c)), if it exists, 
is the line through the point (c, f(c)) whose direction is the ‘limiting 
direction’ of the chords joining the points (c, f(c)) and (x, f(a)) as z tends È T 
to c. This idea is illustrated in Figure 1. 


Figure 1 Constructing the 
tangent to a graph 
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The three examples shown in Figure 2 below illustrate some of the 
possibilities that can occur when we try to construct tangents in particular 
instances. 


v v Vy = k(2) 
Ce 
yY = |z] pi 
T 
gi 
= bee lg. > ey ae | 
x p x E 


(b) (c) 
Figure 2 Different possibilities that occur when trying to construct tangents 


(the rule for the function k is given in the text) 


The function 
f(a) =2? (zER) 


is continuous on R, and its graph (Figure 2(a)) has a tangent at each 
point; for example, the line y = 2x — 1 is the tangent to the graph at the 
point (1,1). 


The function 
g(x) =|z| (x ER) 


is also continuous on R, but its graph (Figure 2(b)) does not have a 
tangent at the point (0,0): no line through the point (0,0) is a tangent to 
the graph. This is because the ‘limiting direction’ of the chords described 
earlier is different depending on whether (0,0) is approached from the left 
or from the right. However, there is a tangent at every other point of the 
graph. 


Finally, the function 


—-1, «<0, 
k(z)= 4 0, «=0, 
iL aes, 


is discontinuous at 0, and no line through the point (0,0) is a tangent to 
the graph (see Figure 2(c)). However, there is a tangent at every other 
point of the graph. We now make these ideas precise, by using the concept 
of limit (which you studied in Unit F1) to pin down what we mean by 
‘limiting direction’. We define the gradient (or slope) of the graph at 

(c, f(c)) to be the limit, as x tends to c, of the gradient of the chord 
through the points (c, f(c)) and (x, f(x)). The gradient of this chord is 


f(a) - f(e) 


TtT—cC 


, where x £c, 


as illustrated in Figure 3. This expression is called the difference 
quotient for f at c. Thus the gradient of the graph of f at the point 
(c, f(c)) is 
ie ee (1) 
rac r~r—C 


provided that this limit exists. We call this limit the derivative of f at c. 


Sometimes it is more convenient to use an equivalent form of the difference 
quotient. If we replace x by c+ h, then ‘x > œ in expression (1) is 
equivalent to ‘h > 0’. The difference quotient for f at c is then 


Q(h) = as where h Æ 0, 


as illustrated in Figure 4, and the gradient of the graph of f at (c, f(c)), 
that is, the derivative of f at c, is given by 

li h 

lim Q(h), 
provided that this limit exists. 


To formalise this concept, we need to ensure that f is defined near the 
point c, so we assume that c lies in some open interval J in the domain of f. 


Definitions 


Let f be defined on an open interval J, and let c € J. Then the 
derivative of f at c is 
TORG 
MSE T—C 


that is, 


3 


: _ fle+h) - fle) 
ea Q(h), where Q(h) = ee 


? 


provided that this limit exists. If the limit exists, then we say that f 
is differentiable at c. If f is differentiable at each point of its 
domain, then we say that f is differentiable (on its domain). 


The derivative of f at c is denoted by f’(c) and the function 
f': c++ f'(x) is called the derivative, or sometimes the derived 
function, of f. 


The operation of obtaining f'(x) from f(x) is called differentiation. 


1 Differentiable functions 


QOs== 
8 
Sy 


Figure 3 The chord joining 
(c, f(©)) and (x, f(x) 


i 


Ny 


| 
| 
E c+h 


Figure 4 The chord joining 
(c, f(c)) and (c+ h, f(e + h)) 
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Remarks 


1. The word ‘differentiable’ arises because the definition involves the 
differences f(x) — f(c) and x — c. 


2. The above box defines the notation for derivatives that we will normally 
use in this module, but there are several others in common use. 
Sometimes f’ is denoted by Df and f'(x) is denoted by D f(x). 
Another frequently used notation is Leibniz notation, in which f’(z) is 

a where y = f(z). 

3. Note that we require f to be defined on an open interval containing c 
because, by the definition of the limit, x can approach c along any 
sequence of points in a punctured neighbourhood of c. In the next 
subsection we will define one-sided derivatives using the corresponding 
one-sided limits that you met in Unit F1. 


written as 


4. The existence of the derivative f’(c) is not quite equivalent to the 
existence of a tangent to the graph y = f(x) at the point (c, f(c)). If 
van FOOL) 
£—>c L—-—C 
point (c, f(c)), and the gradient of the tangent is the value of the limit. 
However, the converse is not necessarily true. The graph may have a 
vertical tangent at the point (c, f(c)), in which case 


f(x) — fF) 


r—C 


exists, then the graph certainly has a tangent at the 


— oo (or —co) as £ > c. 


fœ) =f 


L—C 


In this case, lim does not exist, so f is not differentiable 
zc 


at c. 


The idea of finding the gradient of a graph which is not a straight line 
was one of the first steps in the development of calculus. The major 
figures in this development were Sir Isaac Newton (1642-1727) in 
England and Gottfried Wilhelm Leibniz (1646-1716) in Germany, to 
whom we owe the ‘dy/dx’ notation and the names ‘differential 
calculus’ and ‘integral calculus’. The names ‘derived function’ and 
‘derivative’ and the notation f’ were introduced much later by 
Lagrange (1736-1813). Another mathematician who played a role in 
the early development of calculus was Pierre de Fermat (16017-1665) 
in France. 


Worked Exercise F9 


Prove that the function f(x) = x? is differentiable at each point c € R, and 
determine f’(c). 


We use c to denote a particular point where we are testing for 
differentiability. However, when stating the rule of a derivative, we replace 
c by the usual variable x. Thus the derivative of f in Worked Exercise F9 
is f'(x) = 327. 

To prove from the definition that a function is not differentiable at a point, 
we need to show that the limit of the difference quotient does not exist. 
The following strategy is based on Strategy F1 from Unit F1. 


Strategy F5 


To prove that a function is not differentiable at a point, show that 
ioe Q(h) does not exist by doing either of the following. 
—| 


e Find two null sequences (hn) and (kn) with non-zero terms such 
that the sequences (Q(hn)) and (Q(kn)) have different limits. 


e Find a null sequence (hn) with non-zero terms such that 
Q(hn) > œ or Q(hn) > —00. 


The next worked exercise illustrates how to apply this strategy. 
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Worked Exercise F10 


Prove that the function 


1 
xsin-, «40, 
z 


0, g =; 


f(x) = 
is not differentiable at 0. 


Solution 
@. The graph of f is shown below. 


The difference quotient for f at c= 0 is 


f(h) = f0) 
OO 
_ hsin(1/h) — 0 
p h 
®. Since sin(1/h) oscillates infinitely often in any interval containing 
the origin, giving different tangents at different points, we use the first 
method of Strategy F5. © 


=sin(1/h), where h #0. 


Consider the two sequences 


eee ariel o = : 


— =E 
nit C To 


Then (hn) and (kn) are null sequences with non-zero terms, chosen so 
that 
Q(hn) = sin(1/hn) = sin(nrt) = 0 > 0 as n —> 00 
and 
Q(kn) = sin(1/kn) = sin(2n + $)m = 1 > 1 as n > oo. 
Since these limits are different, f is not differentiable at 0. 


®. Other choices of (hn) and (kn) are possible; for example, 
il 


il 
hn = — and kn = i S Wy Booey 


2nt (2n — $) T 


giving Q(hn) = 0 and Q(kn) = —1. # 
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Worked Exercise F10 shows that the domain of a derivative f’ can be 
smaller than the domain of f. 


Exercise F15 


(a) Prove that the function f(x) = 1/z is differentiable at each point 
c € R— {0}, and determine f’(c). 
(b) Prove that the function 
x? cos(1/x), «#40, 
f(x) = 
0, t=0, 


is differentiable at 0, and determine f’(0). 


Hint: Use the Squeeze Rule for limits from Subsection 1.3 of Unit F1. 


FSS 
O 
x 


Prove that the function f(x) = |z| is not differentiable at 0. 
d) Prove that the function 


1/2 in(1/a), x 40, 
jos ft sin(I/x), 2 4 
0, xz =0, 


=~ 


is not differentiable at 0. 


We now establish the derivatives of some basic functions. 


Theorem F21 Basic derivatives 


(a) If f(z) =k, where k € R, then f'(x) = 0. 

(b) If f(z) = 2", where n EN, then f!(z) =nz"“!. 
(e) Ty (a) = sing, then / (7) = cosa: 

(d) If f(z) = cosa, then f'(x) = —sinz. 

e) Tifa) =e" them 7 (2) =e". 


Proof (a) If f(z) =k, then the difference quotient for f at any point 
is 0, so f'(x) =0. 
(b) The difference quotient for f at c is 
fle+h)— fle c+h)” —c” 
gua Let MLO etie 
h h 
@. Recall that, for a,b € R and n € N, 


(n 


— 1 
(atb) =a" yna ba aam + +". 


This is the Binomial Theorem, which you met in Subsection 3.4 of 
Unit D1 Numbers. & 
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So, by the Binomial Theorem, 


(n 


Q(h) = = G Hne Ih ++ dua on2y2 aes a | il e) 


h 

n(n — 1) 
2 

Thus Q(h) > nc”! as h —> 0, so f is differentiable at c for any c € R, 

and f'(x) = ngt. 


The difference quotient for f at c is 


= net + eT? h+- +h, where h £0. 


sin(c + h) — sinc 
h 


sinc cos h + cosc sinh — sin c 


h 
j -1 
= cos C (5#) + sinc (==) , where h £0. 


®. You met the limits 


sin h h-1 
mee 1 and = lim I eo 0 
h->0 h 


Q(h) = 


in Subsections 1.1 and 1.3 of Unit F1, respectively. © 
Thus 


Q(h) > cosc x 1+ sine x 0=cosc as h —> 0, 
so f'(x) = cosa, as required. 


The proof of this part is similar to that of part (c). We omit the 
details — you may like to write out the proof for yourself. 


The difference quotient for f at c is 


ecth ec 
Q(h) = 
(er -1 
=e (5 ye where h Æ 0. 
h 


= 1 
- = | in Subsection 1.3 of Unit F1. @ 


@ You saw that lim 
h—>0 

Thus 
Q(h) > e° x 1= e° as h —> 0, 


so f'(x) = e”, as required. E 


In general, when we differentiate a function f, we obtain a new function f’ 
whose domain may be smaller than that of f. The notion of 
differentiability can then be applied to the function f’, yielding another 
function whose domain consists of those points where f’ is differentiable. 


1 Differentiable functions 


Definitions 


Let f be differentiable on an open interval J, and let c € I. If the 
derivative f’ is differentiable at c, then we say that f is 

twice differentiable at c, and the number f”(c) = (f’)’(c) is called 
the second derivative of f at c. The function f”, also denoted 
by f©), is called the second derivative (or second derived 
function) of f. 


Similarly, we can define the higher-order derivatives of f, denoted 
by f® = f", f, and so on. 


Remarks 


1. You may meet many different ways of denoting the second derivative of 
a function. For example, f” is sometimes denoted by Df’ or D?(f). 
2 
In Leibniz notation f”(x) is written as a where y = f(x). 
a 
2. Some functions can be differentiated as many times as we like. For 
example, if f(x) = e”, then 


Fase, Pase, fO@)=e, 


However, not every derivative is differentiable at all points of its 
domain; you will see an example of this in Exercise F16. 


1.2 One-sided derivatives 


Although we know that the function f(x) = |x| is not differentiable at 0, 
the graph of f shown in Figure 5 suggests that chords which join the origin 
(0,0) to points (h, f(h)) have gradients equal to 1 if h > 0, and equal to 
—1 if h < 0. This example suggests the concept of a one-sided derivative. 


aY 


0 


Definitions 


; Figure 5 The graph of 
Let f be defined on an interval J, and let c € J. Then the f(x) = |z] 


left derivative of f at c is 


ma eS i Q(h), 


£—cT TEE h-0-— 


f(c) = 
provided that this limit exists. If the limit exists, we say that f is 
left differentiable at c. 

Similarly, the right derivative of f at c is 


x2—et T= € hot 


provided that this limit exists. If the limit exists, we say that f is 
right differentiable at c. 


67 


Unit F2 Differentiation 


Figure 6 ‘The Glue Rule for 
differentiable functions 


68 


Remarks 
1. The definition uses the concept of right and left limits which were 
defined in Subsection 1.4 of Unit F1. 


2. In some texts, a function f defined on a bounded closed interval J is 
defined to be differentiable on J if f has a derivative at each interior 
point of J and appropriate one-sided derivatives at each endpoint of I. 


The next theorem establishes the relationship between derivatives and 
one-sided derivatives. 


Theorem F22 
Let f be defined on an open interval J, and let c € I. 
(a) If f is differentiable at c, then f is both left differentiable and 
right differentiable at c, and 
file) = frlo) = f'(e). 
(b) If f is both left differentiable and right differentiable at c, and 
fi (©) = fr(c), then f is differentiable at c and 
f'(©) = filo) = fr(o)- 


Theorem F22 is closely related to the next theorem, the Glue Rule for 
differentiable functions, which is illustrated in Figure 6 and stated below. 
We omit the proofs of both these results as the ideas involved are very 
similar to those used in the proof of the Glue Rule for continuous functions 
(see Subsection 2.2 of Unit D4). 


Theorem F23 Glue Rule for differentiable functions 
Let f be defined on an open interval J, and let c € I. If there are 
functions g and h defined on J such that 
l go) = g) t m E rc 

me = hte), or a el, > G 
2, a = gle) = We) and 
3. g and h are differentiable at c, 
then f is differentiable at c if and only if g'(c) = h' (c). If f is 
differentiable at c, then f’(c) = g'(c) =h’'(c). 


The Glue Rule enables us to determine whether or not certain hybrid 
functions are differentiable at particular points, without using the 
definition of differentiability. The next worked exercise illustrates how to 
use the Glue Rule in this way. 


Worked Exercise F11 


Use the Glue Rule to prove that the function 


i, x <0, 
ro- 


sinz, x>0, 


is differentiable at 0, and determine f’(0). 


Solution 
®. The graph of f is shown below. 


YA 


Oj (sual ae 


Let J be the open interval R and define 
g(x) =x (ER) and A(xz)=sinz (« ER). 
Then f is defined on J and 0 € J. Also, 
E S= 9), for wv <0), 
Ie = Wr) for 7 S 
so condition 1 of the Glue Rule holds with c = 0. 


Furthermore, f(0) = g(0) = h(0) = 0, so condition 2 holds, and g and 
h are differentiable with 


giv) =1 and a = cosz, 
so condition 3 holds. 


Since g'(0) = 1=h'(0), we deduce that f is differentiable at 0, with 
f’(0) = 1, by the Glue Rule. 


If we want to prove that the function f in Worked Exercise F11 is 
differentiable at a point c other than 0, then we can use the fact that 
differentiability is a local property of the function; that is, it depends on 
the behaviour of the function in any open interval (no matter how short) 
containing c. Thus 


f'(x) — re a 1, T< 0, 


h'(x) = cosg, x> 0, 
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and hence, on combining this result with Worked Exercise F11, we have 
j 1, x <0, 
f(x) = 
cosa, «> 0. 


You can use this approach in the following exercise. 


Exercise F16 


Prove that the function 
2 
=p «<0, 
fa)= 7 
a, £20), 


is differentiable, and has derivative f'(x) = 2\z|. 


Since the function f'(x) = 2|2| is not differentiable at 0, Exercise F16 
shows that a derivative need not be differentiable at all points of its 
domain. 


Another consequence of the fact that differentiability is a local property is 
that the restriction of a differentiable function to an open subinterval of its 
domain gives a new differentiable function. For example, the function 


f(x) =a" (we (2,3)) 


is differentiable, since f(x) = x? is differentiable on R. 


1.3 Continuity and differentiability 


Next we discuss the relationship between continuity and differentiability. 
First we show that a differentiable function is continuous. 


Theorem F24 


Let f be defined on an open interval J, and let c € J. If f is 
differentiable at c, then f is continuous at c. 


Proof If f is differentiable at c, then 
on LO o 


in = Fe: 
For x € I and z Æ c, we have 
fle) — fo = EAO y æ- o) 


(x — c) 


Hence, by the Combination Rules for limits (see Subsection 1.3 of Unit F1), 
f(x) -f(d —> f(c) x0=0 as z> c. 


Thus f(x) > f(c) as z > c, so f is continuous at c. E 


The following corollary gives us a test for non-differentiability; it is simply 
the contrapositive of Theorem F24 (and so is equivalent to it). 


Corollary F25 


Let f be defined on an open interval J, and let c € J. If f is 
discontinuous at c, then f is not differentiable at c. 


For example, the function 


-1, «<0, 
kK(x)=4 0, x=0, 
1, x>0, 


is discontinuous at 0 because lim k(x) =1k(0). Thus k is not 
«0 


differentiable at 0, by Corollary F25. 


It is important to remember that a function can be continuous at a point 
but not differentiable at that point; for example, the modulus function is 
continuous at all points of R, but it is not differentiable at 0, as you saw in 
Exercise F15(c). This example can readily be modified to produce a 
continuous function which is not differentiable at any given finite set of 
points. It is even possible for a function to be continuous throughout its 
domain but nowhere differentiable, as we discuss below. 


A continuous nowhere-differentiable function 
(optional) 


The remainder of this section is not assessed, and is included only for your 
interest. 


In the nineteenth century, when the concepts of continuity and 
differentiability were first made precise, it was widely believed that if 
a function is continuous at all points of an interval, then it must be 
differentiable at most points of that interval. However, it turns out 
that there exist functions which are continuous everywhere but 
differentiable nowhere. The first example was found as early as 1834 
by Bernard Bolzano (1781-1848), but his pioneering work on analysis 
was not widely known. The first well-known example was constructed 
by Karl Weierstrass (1815-1897) in 1872. Such ‘pathological’ 
functions were regarded by some with suspicion. For example, the 
French mathematician Charles Hermite (1822-1901) wrote to a 
colleague in 1893, ‘I recoil in fear and loathing from that deplorable 
evil: continuous functions with no derivatives.’ However, in modern 
times it has been shown that such functions are in some sense normal 
and even useful. 
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In Subsection 3.2 of Unit F1 you met the blancmange function and saw 
that it is continuous at all points in its domain R. We now prove that this 
function is nowhere differentiable. Recall that the blancmange function B 


is defined as follows: 


Bie) => Z (2"2) = e(z) +4s(2r) + }s(40)+--» (©@€R), (2) 


The graph of the function s has a ‘corner’ at each point of the form k/2, 
where k € Z, so the graph of each function x ++ 2~"s(2"x) has a corner 
at each point of the form k/2"+!, where k € Z. This suggests that the 
graph of the function B, shown in Figure 7, does not have a tangent at any 
point of R, and we now prove that this is the case. 


YA 


Nie 
| 


0 


Figure 7 The blancmange function 


The name ‘blancmange function’ was used in the 1980s by the English 
mathematician David Tall, who remarked that B is nowhere 
differentiable because it wobbles too much! 


Theorem F26 


The blancmange function is not differentiable at any point c € R. 


To prove Theorem F26, we first prove three preliminary lemmas. 


As we will need to refer to the difference quotients of several functions, we 
adopt the following notation which shows the dependence on both the 
point c and the function f: 


flet+h) — fle) 


Qe g(h) = n ---o where h Æ 0. 
Lemma F27 
Let B be the blancmange function. Then for m = 1,2,..., we have 


B(x) = s(x) + 59(22) JF eee oe anni 9(2"" ia) + =a BQ"2), 


and the function 


S(t a) 


1 
xr — s(x) + 53 9(2@) + + el 


is linear on all intervals of the form [p2~™, (p + 1)27™], where p € Z. 


Proof ®. Recall that a function is linear if it has a rule of the form 
xt az + b for some a,b € R. & 


This follows directly from equation (2) and the definition of s. | 


Lemma F28 


Let B be the blancmange function. Then for each x € [0,1], there 
exist h, k ~ 0 such that 


g+h,c+ke [0,1] and |QzpB(h) — Qz.a(k)| > 1. 


Proof ®. The proof is illustrated in the following figure. 


1 
2 


Observe that the graph of y = B(x) is symmetrical about the line x = 5; 
and that B(x) > x for x € [0,5]. @ 
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We can assume by symmetry that 0 < x < Z. We now choose h so that 
z+h= 1- zx, and k so that z +k = 0 (or k = 4 if x = 0). Then 


B(x+h)—- B(x) B(1-—xzx)-— B(x) 


Qz,B(h) = h = = 0, 


by symmetry. Also, for x 4 0, 


Bia +k) -— Bla B(O) — B(x 
gapa 2+ = BG) _ BOO) — BE) 5, 
—2x 
because B(0) = 0 and B(x) > x for x € [0,5]. A similar argument shows 
that the same result holds for x = 0. Thus the inequality in the statement 
of the lemma follows, as required. E 


an 


Lemma F29 


Given any real function f and a linear function g(x) = ax + b, the 
corresponding difference quotients of the functions f and f + g always 
differ by a, the gradient of the linear function g. 


Proof This holds because 


Qe f+g(h) — Qep(h) = Qeg lh) = (gle + h) — g(c))/h = a, 
for any c € R and h #0. E 


We now use these three lemmas to prove that the blancmange function is 
nowhere differentiable. 


Proof of Theorem F26 Let ce R and choose integers pm, 
m = 0,1,2,..., such that c € Im, where 


Im = [Pm ™, (Pm + Le. 
®. Note that (Im) is a ‘nested’ sequence of closed intervals, each of which 
contains the point c. © 


Now it follows from Lemma F27 that, on the interval Im, the function B is 
the sum of a linear function and the function B,,(x) = 2~™ B(2™z), which 
is obtained from B by scaling in both the x- and y-directions by the 

factor 27”. 


@®. The graph of Bm on the interval Im is a scaled-down copy of the graph 
of B on [0,1] (see Figure 8): it is a mini-blancmange! .@ 


Difference quotients are unchanged by such a scaling, so Lemma F28 
implies that there exist hin, km 4 0 such that 


cthm,ct+tkm€Im and |QeB,,(hm) — Qe,Bm(km)| 2 1. 


Therefore, by Lemma F29, 
IQc,B(Rm) — Qe,B(km)| = |Qe,Bm (Am) — Qe,Bm(km)| 2 1. (3) 


®. This holds because the difference quotients of the linear part of the 
function B cancel out. © 


Now since c, ct hm, c+ km E Im, for m=1,2,..., and Im has 
length 2~™, we have hm + 0 and km > 0. Thus if B is differentiable at c, 
then it follows from the definition of differentiability that 


Qc,B(hm) > B'(c) and Qe,B(km) > B’(c) as m > oo. 


But this contradicts inequality (3), so B is not differentiable at c. | 


Finally we note that, since the function B is continuous, it follows from the 
Extreme Value Theorem (see Subsection 3.3 of Unit D4) that it must have 
a maximum and a minimum on the interval [0,1]. The minimum is 

B(0) = B(1) =0, but the maximum is not so clear. In fact, it can be 
shown that the maximum is B(%) = 2, and that this value is taken at 


infinitely many points in [0, 1]; a strange function indeed! 


2 Rules for differentiation 


In Section 1 you saw how to show that various basic functions are 
differentiable on R, by working directly with the definition of a 
differentiable function. In this section you will see that we can often show 
that a function is differentiable by using the Combination Rules, the 
Composition Rule (also called the Chain Rule) and the Inverse Function 
Rule for differentiable functions. 


We will use these rules to determine the derivatives of many more 
functions. The most important of these (together with the basic functions 
from Section 1) are summarised in a table of standard derivatives, which 
can be found at the end of this unit and in the module Handbook. 


2.1 Combination Rules 


The Combination Rules for differentiable functions are a consequence of 
the Combination Rules for limits that you met in Subsection 1.3 of 
Unit F1. 
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Theorem F30 Combination Rules for differentiable 
functions 

Let f and g be defined on an open interval J, and let c € I. If f and g 

are differentiable at c, then so are the following functions. 


Sum Rule f +g, with derivative 
(f+9)'(o) = fie) +9") 
Multiple Rule Af, for A € R, with derivative 
(AF) Ce) = Af’ (c) 
Product Rule fg, with derivative 
(Fg) (c) = f'(c)g(e) + Flog’) 
Quotient Rule f/g, provided that g(c) Æ 0, with derivative 
(£) o IF) = Fogle) 
g (g(e))? 


Each of these rules can be expressed in any of the alternative notations 
commonly used for derivatives. For example, in Leibniz notation, the 
Product Rule becomes 
dy du du 
if y = uv, then -+ = —v + u—. 
á f dx dz i dx 


Similarly, the Quotient Rule is written 


. dy 1 du dv 
if y = u/v, then oa (= a) 


Proof of Theorem F30 ®. To prove each rule, we denote the function 
that we wish to show is differentiable by F, and express the difference 
quotient for F in terms of the difference quotients for f and g. ® 
Suppose that f and g are both differentiable at c. 


Sum Rule Let F = f +g. Then 
F(x) —F(c) _ (F(z) + g(z)) - (F) + g(0)) 


r—-c 7 Z-—C 
afera , a2) - 99) 
L-C L-C 
> f'(e)+g'(c) as x > c, 
by the Sum Rule for limits, since f and g are differentiable at c. Thus F is 
differentiable at c, with derivative 


F' (c) = f'(c) + g' (0). 
Multiple Rule This is a special case of the Product Rule, with g(x) = A. 


Product Rule Let F = fg. Then 
F(x)- Fle) _ flx)g(x) — fle)g(e) 


= 2e -4O gia) fi =O 


> fi(e)g(e) + f(c)g'(c) as z > c, 
by the Combination Rules for limits, since f and g are differentiable at c, 


and g is continuous at c by Theorem F24, so that g(x) > g(c) as z > c. 


Thus F is differentiable at c, with derivative 


F'(c) = f'(o)a(e) + f(g‘) 
Quotient Rule ®. We first use the ¢-6 definition of continuity that you 
met in Subsection 3.1 of Unit F1 to show that, since g(c) 4 0 and g is 
continuous at c, then g must be non-zero on an open interval containing c. 
To show this, we take e = $|g(c)| in the definition. @ 


Let F = f/g. Since g is continuous at c and g(c) 4 0, there exists ô > 0 
such that J = (c—6,c+6) C I and 


lg(x) — g(c)| < żlg(c)|, for all x with |x — c| < 6. 


In particular, this shows that g(x) 4 0 for x € J, so the domain of F 
contains J. Then, for x € J, 


F(a) - FO -> (4-4) 


r—c = æ—ce\glz) gle 


_ ( 
) 


glc) F'(c) — Fg'(©) 


> SS aa te, 


(g(c))? 


by the Combination Rules for limits, since f and g are differentiable at c, 
and g is continuous at c. 


Thus F is differentiable at c, with derivative 


ra goro- 
i er o u 


Since, by Theorem F21(b), the derivative of f(x) = x” is f'(x) = nz”! for 
any n € N, it follows from the Combination Rules that any polynomial 
function is differentiable on R and that its derivative can be obtained by 
differentiating the polynomial term by term. We state this result as a 
corollary of the Combination Rules. 
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Corollary F31 
Let 


p(x) = ao Je ana A aa Ae o a o di een” (x € R), 
where ag, @1,.--,@, E€ R. Then p is differentiable on R, with derivative 


p(x) =a, + 2agr+-+++nanz™ (TER) 


Furthermore, since a rational function is a quotient of two polynomials, it 
follows from Corollary F31 and from the Quotient Rule that a rational 
function is differentiable at all points where its denominator is non-zero, 
that is, at all points of the domain of the function. Here is an example. 


Worked Exercise F12 


Prove that the function 


f@)=z— (@eER-{-11} 


is differentiable on its domain, and find its derivative. 


The Quotient Rule can also be used to show that the formula for 
differentiating f(x) = x”, given in Theorem F21(b), remains valid if n is a 
negative integer. 


We now give several exercises. It may not be necessary for you to work 
through all these exercises if you are confident that you can use the 
Combination Rules. 


Exercise F17 


Find the derivative of each of the following functions. 
(a) f(x) =x" — 2x4 + 3x3 —5x+1 (ER) 
xr? +1 
-Z (œeRr-{1}) 
(c) f(x) =sinxcosx (æ €R) 


(da) f(z) = 


e% 


— ER 
3 + sing — 2 cosx (a ) 


Exercise F18 


Find the third derivative of the function 
f(z) =aze” (xeR). 


In Section 1 we differentiated the functions sin, cos and exp. We now ask 
you to use these basic derivatives and the Combination Rules to find the 
derivatives of the other trigonometric functions and the three most 
common hyperbolic functions. The derivatives of these functions are 
included in the table of standard derivatives given at the end of this unit 
and in the module Handbook. 


Exercise F19 


Find the derivative of each of the following functions. 
(a) f(v)=tanz (x# (k+ 4yr, kez) 

(b) f(x) =cosecar (x # kr, k€ Z) 

(c) f(a) =seca (a A(k+5)n, kez) 

(da) f(x)=cotx (x # kr, k€ zZ) 


Exercise F20 


Find the derivative of each of the following functions. 
(a) f(x)=sinhx (x €R) 
(b) f(x)=coshx (x €R) 
(c) f(x)=tanhx (xE€R) 
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2.2 Composition Rule 


In Subsection 2.1 we extended our stock of differentiable functions to 
include all polynomial, rational, trigonometric and hyperbolic functions. 
We also need to be able to differentiate functions such as 


f(x) =sin(cosz) (x €R), 


which is the composite of the two differentiable functions sin and cos. 
To do this, we use the following Composition Rule for differentiable 
functions, which is commonly known as the Chain Rule. 


Theorem F32 Composition Rule for differentiable 
functions (the Chain Rule) 

Let f be defined on an open interval J, let g be defined on an open 

interval J such that f(I) C J, and let c € T. 


If f is differentiable at c and g is differentiable at f(c), then go f is 
differentiable at c and 


(go F) (e) =9 (FO) F (c). 


Remarks 

1. The Composition Rule tells us that ‘a differentiable function of a 
differentiable function is differentiable’, and gives us a formula for its 
derivative. 


2. When written in Leibniz notation, the Composition Rule has a form 
that is easy to remember: if we put 
u= f(x) and y= g(u) = g(f(x)), 
then 
dy dy du 


d£ du dx’ 


3. The Composition Rule can be extended to a composite of three or more 


functions; for example, 
(hogo f)(x) = K (gf) (FE) f E). 
In Leibniz notation, if we put 
v= f(x) u=g(v) and y= h(u)= h(g(f(x))), 
then 
dy dy du P dv 
dx du dv” dx 
We often use this extended form of the Composition Rule without 
mentioning it explicitly. 


Proof of Theorem F32 Let F =go f. The difference quotient for F 
at c is 


F(x) —F(o) _ 9(f(@)) — 9(F(0) (4) 
L—C L—C 

Let y = f(x), where x € I, and let d = f(c). Then the right-hand side of 
equation (4) can be written 

(“2 — 72) (2 — fo) , provided that y Æ d. (5) 

y—d z-e 

To avoid the difficulty that expression (5) is undefined if y = d, which can 
occur in some situations, we introduce the function 


g(y) — g(d) 
hy)=} yaa 7 PTA 
g'(d), y=d. 


Since g is differentiable at d, 
h(y) > g'(d) as y > d; 
and since h(d) = g'(d), it follows that h is continuous at d. 


By the Composition Rule for continuous functions (see Subsection 2.2 of 
Unit D4), and recalling that y = f(x) and d = f(c), we deduce that 


D-O y, 
(hof\(a)=¢ Fæ- A 
is continuous at c. 
Next, note that if f(x) 4 f(c), then equation (4) and expression (5) give 
F(a) - Flo) _ f(x) = f(e) 
pom ea (2). (6) 
Equation (6) is also true when f(x) = f(c), since both sides are then 0. 


If we now let x tend to c in equation (6) and use the continuity at c of the 
function ho f, then we obtain 


Pa) FO, INSO asee 


Thus F is differentiable at c, with derivative 


Fo =I (FO) f(c). = 
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If you are interested in why we need to allow for the possibility that y = d 
in the above proof, here is an example of a situation where this arises. 
Suppose that f is the function 
x*cos(1/x), «#40, 
f(x) = 
0, y= Q; 
which is differentiable at 0, as you showed in Exercise F15(b). Then if 
c = 0 we have d = f(c) = 0, and since cos(n + 5)t = 0 for n EN, it follows 


that y = f(x) takes the value 0 infinitely many times in any interval 
containing 0. 


Worked Exercise F13 


Prove that each of the following composite functions is differentiable on its 
domain, and find its derivative. 


(a) k(x) =sin(cosx) (x €R) 
(b) k(x) =cosh(e2”) (x € R) 
(c) k(x) = tan(x?) (a € (-1,1)) 
Solution 
(a) Here k(x) = sin(cos x), so let 
(2) = cose and) o(7)—sing (77 
Then f and g are differentiable on R, and 
fi(z)=—sing and g/(z)=cosx (x ER). 
®. Here we write out all the steps in the application of the 


Composition Rule, but it is not necessary for you to do this in 
your solutions. © 


By the Composition Rule, k = go f is differentiable on R, and 
ki(x) = g'(f(x)) f (2) 
= cos(cos x) x (— sin x) 
= — cos(cos x) sin x. 
(b) Here k(x) = cosh(e?*), so let 
o S= “g(a)—e and h(x) —cosha (eR). 
Then f, g and h are differentiable on R, and 
rel= de= and W@)=sinhz We). 
®. Here we use the extended form of the Composition Rule. ® 
By the Composition Rule, k = ho g o f is differentiable on R, and 
k'(x) = h'(g(f(2))) (F(E) F (2) 
= sine) ae x2 


= 2e” sinhe”) 
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Exercise F21 


Find the derivative of each of the following functions. 
(a) f(x) =sinh(z?) (x € R) 
(b) f(x) =sin(sinh2z) (# € R) 


cos 2x 


O fle) =sin( SS") (e e (0,00) 


2.3 Inverse Function Rule 


In Section 4 of Unit D4 we discussed inverse functions and proved that if a 
function f with domain an interval J and image set J = f(J) is strictly 
monotonic and continuous on J, then J is an interval, and f possesses a 
strictly monotonic and continuous inverse function f~! with domain J. 
(Recall that strictly monotonic means that f is either strictly increasing or 
strictly decreasing.) In particular, we showed that the power functions, the 
trigonometric functions, the exponential function and the hyperbolic 
functions all have continuous inverse functions, provided that we restrict 
their domains where necessary. 


These standard functions are all differentiable on their domains, and we 
now investigate whether their inverse functions also have this property. 
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Figure 9 The tangent at a 
point on the graph y = f(x) 
and its reflection in the line 
=z 


Figure 10 The graph of f in 
Theorem F33 


Figure 11 The graph of f~! 
in Theorem F33 
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It is instructive to begin by considering their graphs. Recall that we obtain 
the graph y = f—!(a) by reflecting the graph y = f(x) in the line y = z, 
which maps a typical point A(c,d) on the graph y = f(x) to the point 
B(d,c) on the graph y = f~!(z), as illustrated in Figure 9. This suggests 
that if the gradient of the tangent to the graph y = f(x) at the point A is 
f'(c) =m, then the gradient of the tangent to the graph y = f~'(x) at B 
is (ft) (d) =1/m, provided that m 4 0. However, if the graph of f has a 
horizontal tangent (m = 0) at a point A, then the graph of f~! has a 
vertical tangent at the corresponding point B; in this case, f~! is not 
differentiable at B, since 1/m is not defined for m = 0. We therefore need 
the condition ‘ f'(x) is non-zero’ in our statement of the rule for 
differentiating inverse functions. The Inverse Function Rule for 
differentiable functions is formally stated as the next theorem and 
illustrated in Figures 10 and 11. 


Theorem F33 Inverse Function Rule for differentiable 
functions 
Let f be a function whose domain is an open interval J on which f is 


continuous and strictly monotonic. Then f has an inverse function 
fo) wi domam J = j(i). 


If f is differentiable on J and f'(x) 4 0 for x € I, then f~! is 
differentiable on J. Also, if c € I and d = f(c), then 


The Leibniz notation for derivatives can be used to express the Inverse 
Function Rule in a form that is easy to remember: if we put 


y=f(x) and «= f(y), 


and write 
dy j dx -1) 
re for f'(x) and W for (f~) (y), 


then 
dz 1 
dy dy/dz` 


Proof of Theorem F33 First note that f has an inverse function f—! 
with domain J = f(I) by the Inverse Function Theorem for continuous 
functions, proved in Subsection 4.1 of Unit D4. 

Let y € J and y 4d, so f-'(y) = z, where x € I and x £ c (since f is 
strictly monotonic). 


Then the difference quotient for f~! at d is 
ware ae 


y—d Te- FO 
_, /f@=t0 


As y > d, we have z = f~!(y) — c, since f~! is continuous. So 
ft@M-f7°@ _, /f@-fO 
= y-zd pa 
> 1/f'(c) asyd (since f’(c) £0). 


Thus f~t is differentiable at d, with derivative (f~1)/(d) = 1/f'(c). So f7! 
is differentiable on J. a 


The next worked exercise shows how the Inverse Function Rule can be 
used to determine the derivative of the inverse for some standard 
functions. (The derivatives of these inverse functions are included in the 
table of standard derivatives given at the end of this unit and in the 
module Handbook.) 


Worked Exercise F14 


For each of the following functions f, state the domain and rule of f~!, 
show that f~t is differentiable and determine its derivative. 


(a) f(z) =a" (2 ER), wherenEN,n>2 
(b) f(x) =tanaz (a € (—7/2,7/2)) 
(c) fl) =e" (eR) 
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(b) The function 
f(z) =tanz (a € (—7/2,7/2)) 


is continuous and strictly increasing, and f ((—7/2,7/2)) =R. 
Also, f is differentiable on (—1/2, 7/2), and its derivative 
f'(x) = sec? x is non-zero there. So f satisfies the conditions of 
the Inverse Function Rule. 


Hence f has an inverse function f~! and f—~!(y) = tan™! y is 
differentiable on its domain R. If y = f(x) = tan x, then 


il 1 1 1 


=l ! = — - SO = r = 3 
(Fu) = f'(x) seca 1+tan?x 1+4? 
Replacing the domain variable y by x, we obtain 
1 
= 
lan a= Er (c € R). 


(c) The function 
f(z) =e" (ceER) 


is continuous and strictly increasing, and f(R) = (0,00). Also, f 
is differentiable on R, and its derivative f'(x) = e” is non-zero 
there. So f satisfies the conditions of the Inverse Function Rule. 


Hence f has an inverse function f—' (which we call log) and 
f-'(y) = logy is differentiable on its domain (0, oo). If 
m= j (eo) —e> then 


Wigs ===. 


Replacing the domain variable y by x, we obtain 


(1og)'(2) == (w € (0,00) 


Exercise F22 


For each of the following functions f, show that f~! is differentiable and 
determine its derivative. 


(a) f(x) =cosx (ax € (0,7)) (b) f(z) =sinha (x €R) 


The Inverse Function Rule can sometimes be used to find values of the 
derivative of an inverse function f~' even when the equation y = f(x) 
cannot be solved to give a formula for the rule of f~!. The next exercise 
illustrates this point. 


Exercise F23 


Let f(x) = x5 +x-—1 (x €R). 
(a) Prove that f has an inverse function f7} which is differentiable on R. 


(You may assume here that f(R) = R; this was proved in Worked 
Exercise D54 in Unit D4.) 


(b) Find the values of (f~!)/(d) at those points d = f(c) where 
c=0,1,-1. 


Exponential functions 
In Subsection 4.3 of Unit D4 we defined the number a”, for a > 0, by the 
formula 

a” = exp(z log a). 


Since the functions exp and log are differentiable on R and R*Ħ, respectively 
(by Theorem F21(e) and Worked Exercise F14(c)), it follows that we can 
use this formula to determine the derivatives of several related functions. 
The functions in the next two worked exercises are included in the table of 
standard derivatives. Notice that the derivative in Worked Exercise F15 
agrees with the formula for the derivative of f(x) = x”, where n € N. 


Worked Exercise F15 


Prove that, for a € R, the function 
f(z)=2° (x eR") 


is differentiable on its domain, and that 


fi'(e) =ar! (2 €R*). 
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YA 
y= 2" 
1 
x 
Figure 12 The graph of 
f(z) = 2" 
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Worked Exercise F16 


Prove that, for a > 0, the function 
f(x)=a" (weER) 

is differentiable, and that 
f'(z) =a*loga (x €R). 


At the beginning of Book D we posed the following question: does the 
graph of the function f(x) = 2”, shown in Figure 12, have a jump at the 
point V2? We showed in Subsection 4.3 of Unit D4 that f is continuous, 
which proves that its graph has no jumps. In Worked Exercise F16 we 
have now shown that f also is differentiable, so its graph has a tangent at 
every point and thus has no corners. 


Exercise F24 


Prove that the function 
f(z) =27 (xeR?) 


is differentiable, and find its derivative. 


3 Rolle’s Theorem 


In this section and in Section 4 you will meet some of the fundamental 
properties of functions that are differentiable not just at a particular point, 
but on an interval. These results are motivated by the geometric 
significance of differentiability in terms of tangents, and they explain why 
the graphs of differentiable functions possess certain geometric properties. 


3.1 Local Extreme Value Theorem 


In Section 3 of Unit D4 you met some of the fundamental properties of 
functions which are continuous on a bounded closed interval. In particular, 
you studied the Extreme Value Theorem, which states that if a function f 
is continuous on a closed interval [a,b], then there are points c and d in 

(a, b] such that 


f(x) < f(d), for x € [a,b], 
and 
f(z) > f(c), for x € [a,b]. 


This is illustrated in Figure 13. The value f(d) is the maximum of f 

on [a,b], and the value f(c) is the minimum of f on [a,b]. A maximum or 
a minimum of f is called an extreme value of f. But how do we 
determine the points c and d where these extreme values occur? In 
general, this is not easy. However, if the function f is differentiable, then 
we can, in principle, determine c and d by first finding any local extreme 
values of the function f on the interval [a, b]. 


Roughly speaking, for a point c in (a,b), the value f(c) is a local maximum 
of f on [a,b] if f(c) is the greatest value of f in the immediate vicinity of c, 
and a local minimum of f on [a, 6] if f(c) is the least value of f in the 
immediate vicinity of c. These ideas are illustrated in Figure 14 and stated 
formally in the definitions which follow. 


YA 
i 
i y = f(x) 
ae 


Figure 14 The local extreme values of a function 
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Figure 13 The extreme 


values of a function continuous 


on a closed interval 
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f(a) = 2° 


Figure 16 ‘The proof of the 
Local Extreme Value Theorem 
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Definitions 
The function f has 


e a local maximum f(c) at c if there is an open interval 
I=(c—r,c+r), where r > 0, in the domain of f such that 


je) fle owa E 


e a local minimum f(c) at c if there is an open interval 
I = (c—r,c +r), where r > 0, in the domain of f such that 


Maz reo owe ae 


e a local extreme value f(c) at c if f(c) is either a local maximum 
or a local minimum. 


By definition, a local extreme value of a function f defined on a bounded 
closed interval [a,b] is an interior point of [a,b]; that is, a local extreme 
value cannot occur at either of the endpoints a and b. 


If we want to find the local extreme values of a differentiable function f, 
then we can use the following result, which gives a connection between the 
local extreme values of a function f and the points c where f’(c) =0. A 
point c such that f’(c) = 0 is called a stationary point of f; we 
sometimes say that f’ vanishes at c. 


Theorem F34 Local Extreme Value Theorem 


If f has a local extreme value at c and f is differentiable at c, then 


O=; 


Note that the converse of the Local Extreme Value Theorem is false: a 
point where the derivative vanishes is not necessarily a local extreme value. 
For example, the function f(a) = x? does not have a local extreme value 
at 0, although f’(0) = 0; see Figure 15. 


Proof of Theorem F34 We prove the result only for a local maximum; 
the proof of the local minimum version is similar. Suppose that f has a 
local maximum at c. Then there exists a positive number r such that 


f(x) < flo, fore-r<a<cHr. (7) 
Now let 
T , r 
In =c+— and Z,=c-——-, n=2,3,.... 
n n 


®. We do not include n = 1 as we require £n and z’, to lie in the open 
interval (c — r,c + r). The argument which follows is illustrated in 
Figure 16. @ 


Then c < £n < c +r, for n = 2,3,..., so 

f(£n)— f() <0 and a,—c>0, forn =2,3,..., 
by inequality (7). Hence 

F(2n) - Fo) 


In — cC 


<0, forn=2,3,.... 


Since £n > c, we deduce by the Limit Inequality Rule (see Subsection 3.3 
of Unit D2 Sequences) that 


f'(c) = lim f(r) = Flo) <0. (8) 
n— oo y= 
On the other hand, c — r < 2}, < c, for n = 2,3,..., so 


f(c) — F(A <0 and zh —c<0, forn=2,8,..., 
by inequality (7). Hence 
f 
CAERON 


7 , forn=2,3,.... 
z — ce 


Since x}, > c, we deduce that 


n lth) = £0 


t 
> 
f(e) lim tce ~ 0. (9) 
Hence, by inequalities (8) and (9), we have f’(c) = 0, as required. a 


Any extreme value of a function f on a bounded closed interval [a,b] which 
is not f(a) or f(b) must also be a local extreme value. Thus, by 

Theorem F34, such a point x must satisfy f'(x) = 0. This gives the 
following property of the extreme values of a differentiable function on a 
bounded closed interval. 


Corollary F35 


Let f be continuous on the closed interval [a,b] and differentiable on 
(a,b). Then the extreme values of f on [a,b] can occur only at a or b, 
or at points x in (a,b) where f'(x) = 0. 


We now reformulate Corollary F35 as a strategy for locating maxima and 
minima. 
Strategy F6 


To find the maximum and minimum of a function f that is continuous 
on [a,b] and differentiable on (a,b), do the following. 


1. Determine the points c1,c2,... in (a,b) where f’ is zero. 


2. Hence determine the values of 


f(a), fb), Per) Ia) e3 


the greatest of these is the maximum and the least is the minimum. 
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Note that, in some cases, there may be infinitely many points in (a, b) 
where f’ is zero; this is so, for example, if f is constant on (a,b). 


Exercise F25 


Use Strategy F6 to determine the maximum and minimum of the function 
f(x) = sin? x + cos z 


on the interval [0, 7/2]. 


3.2 Rolle’s Theorem 


In the last subsection you saw that if a function f is continuous on the 

closed interval [a,b] and differentiable on the open interval (a,b), then the 

extreme values of f can occur only at a or b, or at some point c in (a,b) 

such that f’(c) = 0. If we also know that the values of f(a) and f(b) are 
ya equal, then we can deduce that there must be some point c in (a,b) where 
f'(c) = 0. This is illustrated in Figure 17. 


Theorem F30 Rolle’s Theorem 


Let f be continuous on the closed interval [a,b] and differentiable 


l 

I 

| 

| n (a,b). If f(a) = f(b), then there exists a point c, witha < c < b, 
a é b T such that 


Figure 17 A function with O=. 
f(a) = f(b) 
Remarks 


1. Rolle’s Theorem is one of the most important theorems in analysis; for 


example, most of the results you will meet in Sections 4 and 5 depend 
on Rolle’s Theorem. 


2. Rolle’s Theorem is an existence theorem; that is, it tells us that a 
point c with the stated property exists, but not how to find it. Often it 
is difficult to evaluate c explicitly, and there may be more than one 
point c in (a,b) at which f’ vanishes. 


3. In geometric terms, Rolle’s Theorem states that if the line joining the 


points (a, f(a)), (b, f(b)) on the graph of f is horizontal, then so is the 
tangent to the graph for some c € (a,b). 
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3 Rolle’s Theorem 


Rolle’s Theorem is one of the foundational theorems in differential 
calculus. Its importance lies in the fact that it is needed in the proof 
of the Mean Value Theorem and for establishing the existence of 
Taylor series. However, when Michel Rolle (1652-1719) made the first 
statement of the theorem in 1690, Taylor series had not yet been 
discovered and calculus was still in its infancy. Moreover, Rolle was 
deeply suspicious of its methods. The first appearance of his theorem 
was not in the context of calculus at all, but in the context of solving 
equations. It is paradoxical that the name of a man renowned for his 
opposition to the infinitesimal calculus should end up attached to one 
of the fundamental theorems in the subject. 


During the eighteenth century the theorem lived on as a theorem in 
algebra until it appeared in the work of Leonhard Euler (1707-1783) 
in 1755. Euler, with the calculus at his fingertips, had no need for 
Rolle’s rather convoluted algebraic method and so for the first time 
the theorem resembled its modern counterpart. It is not known 
whether Euler had read Rolle’s work since, characteristically, he made 
no reference to any earlier work. By the 1830s the theorem had 
appeared in a number of textbooks on the theory of equations, but 
was still not associated with Rolle. It was first ascribed to Rolle by 
Wilhelm Drobisch (1802-1896), a professor at the University of 
Leipzig, in a textbook of 1834. 


In the latter half of the nineteenth century the theorem underwent its 
second significant change. From being a useful result in the theory of 
equations it was transformed into a fundamental theorem in analysis. 
In 1873 Charles Hermite (1822-1901) used the theorem in his Cours 
d’Analyse in the context of the theory of Taylor series, clearly 
attributing it to Rolle. Hermite was the leading French analyst of his 
generation and his Cours d’Analyse was extremely influential in 
France during the latter part of the nineteenth century. His Charles Hermite 
unequivocal association of the theorem with Rolle was decisive for 

future writers. 


Proof of Theorem F36 Suppose that f(a) = f(b). 


If f is constant on [a,b], then f'(x) = 0 everywhere on (a,b); in this case, 
we can take c to be any point of (a,b). 


If f is non-constant on [a,b], then either the maximum or the minimum (or 
both) of f on [a,b] is different from the common value f(a) = f(b). 


®. Since f is continuous on [a,b], it must have both a maximum and a 
minimum on [a,b], by the Extreme Value Theorem; see Subsection 3.3 of 
Unit D4. & 


Since one of the extreme values occurs at some point c with a < c < b, the 
Local Extreme Value Theorem shows that f’(c) must be zero. a 
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We can use Rolle’s Theorem to verify the existence of zeros of certain 
functions which are derivatives. 


Worked Exercise F17 


Use Rolle’s Theorem to show that if 
fla) = 324 — 2r? = 227 + 2x, 


then there is a value of c in (—1,1) such that f’(c) = 0. 


For the function in Worked Exercise F17 we can find a value for c directly 
by using the fact that 


f'(x) = 122° — 62? — 4r + 2 
= 2(3x° — 1) (2x — 1). 


Thus f’ has a zero at each of the points —1/v3, 1/V3 and 4, which are all 
in (—1,1). 


Exercise F20 


Use Rolle’s Theorem to show that if 
f(x) = x4 — 4r? + 3a? +2, 
then there is a value of c in (1,3) such that f’(c) = 0. 


Exercise F27 


For each of the following functions, state whether Rolle’s Theorem applies 
for the given interval. 


(a) F@)=tang, [0,7] 
b) f(x) =3|@—-1]—2, [0,3] 
c) f(z) =2-927 +828, [0,1] 


( 
( 
(d) f(z) =sinz+tan“!2, [0,7/2] 


4 Mean Value Theorem 


4 Mean Value Theorem 


In this section you will continue to study the geometric properties of 
functions that are differentiable on intervals and meet some of their 
applications. 


4.1 Mean Value Theorem 


First we recall the geometric interpretation of Rolle’s Theorem from the 
previous section. Rolle’s Theorem tells us that, under suitable conditions, 
if the chord joining the points (a, f(a)) and (b, f(b)) on the graph of f is 
horizontal, then so is the tangent to the graph for some c in (a,b). 


xY 


If you imagine pushing this horizontal chord, always parallel to its original 
position, until it is just about to lose contact with the graph of f, then it 
appears that at this point the chord is a tangent to the graph; see the top 
graph in Figure 18. This ‘chord-pushing’ approach suggests that even if 
the original chord is not horizontal (that is, if f(a) # f(b)), then there 
must still be a point c in (a,b) at which the tangent is parallel to the 
chord; see the bottom graph in Figure 18. 


RY 


Figure 18 A tangent parallel 
Worked Exercise F18 to a chord 


Consider the function 


f(x) =2° —324+3 (re [1,2]). 


Find a point c € (1,2) such that the tangent to the graph of f is parallel to 
the chord joining (1, f(1)) to (2, f(2)). 
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di y = f(x) 
Jos 
| hlz 
O 


Figure 19 The vertical 
distance h(x) 
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Now f is a polynomial, so it is differentiable on (1,2), and its 
derivative is f’(x) = 3x? — 3. Hence f(c) = 4 when 3c? = 7; that is, 
when e= y 1/3 = 153: 


Thus at the point (c, f(c)) the tangent to the graph is parallel to the 
chord joining the endpoints of the graph. 


We now generalise Rolle’s Theorem and show that there is always a point 
where the tangent to the graph is parallel to the chord joining the 
endpoints. This result is known as the Mean Value Theorem, so-called 
since the gradient of the chord, 


f(b) — f(a) 
b-a ’ 


can be thought of as the mean value of the derivative between a and b. 


Theorem F37 Mean Value Theorem 


Let f be continuous on the closed interval [a,b] and differentiable 
on (a,b). Then there exists a point c in (a,b) such that 


a 


Remarks 


1. Like Rolle’s Theorem, this result is an existence theorem: it does not 
tell us what the point c is, just that such a point exists. 


2. When f(a) = f(b), the Mean Value Theorem reduces to Rolle’s 


Theorem. 


Proof of Theorem F37 The gradient of the chord joining the points 
(a, f(a)) and (b, f(b)) is 
_ £0)= fa) 
b-a ’ 
so the equation of the chord is 
y = m(x — a) + f(a). 


It follows that, for x € [a,b], the vertical distance h(x) from the point 
(x, f(x)) to the chord, as shown in Figure 19, is given by the function 


h(x) = f(x) — (m(x — a) + f(a). 


Now h(a) = h(b) = 0, and A is continuous on [a,b] and differentiable on 
(a,b). Thus h satisfies all the conditions of Rolle’s Theorem. 


It follows that there exists a point c in (a,b) such that h’(c) = 0. Since 
h'(c) = f'(c) — m, we have 


f(b) = f(a) 
b-a ’ 


as required. | 


fie) =m= 


Worked Exercise F19 
Use the Mean Value Theorem to show that if 


z—1 
= ae 


then there is a point c in (1, 7) such that f’(c) = 2. 


For the function in Worked Exercise F19 we can find a value for c directly. 
Since f'(x) = 2/(x + 1)?, the point c satisfies 
2 2 . 2 
f'(o) = (e+ 1? = g? that 15, (c+ 1) =9. 
This equation has solutions 2 and —4, so c = 2 (since 2 € (1, 4)). 


In the following exercise, a value for c cannot be found in this direct way. 


Exercise F28 


Use the Mean Value Theorem to show that if 
f(x) = ze", 
then there is a point c in (0,2) such that f’(c) = e. 
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4.2 Positive, negative and zero derivatives 


We now study some consequences of the Mean Value Theorem for 
functions whose derivatives are always positive, always negative, or always 
zero. First we prove a fundamental result about monotonic functions, 
which you used to help with graph sketching in Section 2 of Unit A4 Real 
functions, graphs and conics. 


For any interval J, the interior of J is the largest open subinterval of J. It 
is obtained from I by removing any endpoints of J, so it consists of all the 
interior points of T. 


Theorem F38 _ Increasing—Decreasing Theorem 


Let f be continuous on an interval J and differentiable on the interior 
of I. 
(a) If f'(x) > 0 for x in the interior of J, then f is increasing on J. 


(b) If f'(x) < 0 for x in the interior of J, then f is decreasing on T. 


Proof Choose any two points xı and xə in I, with zı < x2. The 
function f satisfies the conditions of the Mean Value Theorem on the 
interval [x1, x2], so there exists a point c in (x1, x2) such that 


f (x2) = f (21) = f'(c). 
T2 — Tı 
Hence f(x2)— f(x1) has the same sign as f'(c), so 
(a) if f'(x) > 0 for z in the interior of J, then f(r2) — f(a1) > 0 and 
hence f is increasing on I 


(b) if f'(x) <0 for x in the interior of J, then f(#2) — f(x1) < 0 and 
hence f is decreasing on T. | 


There is also a version of Theorem F38 with the weak inequalities replaced 
by strict inequalities, in which case the conclusions are as follows. 


(a) If f'(x) > 0 for x in the interior of J, then f is strictly increasing on T. 
(b) If f’(x) <0 for x in the interior of J, then f is strictly decreasing on I. 


Note that the converse of each of these two statements with strict 
inequalities is false. For example, f(x) = x° is strictly increasing on 
[—1,1], but f’(0) =0. 


Exercise F29 


On the given interval J, determine whether each of the following 
functions f is 


e strictly increasing 

e increasing, but not strictly increasing 
e strictly decreasing, or 

e decreasing, but not strictly decreasing. 
(a) f(x) =3a4/3 — 4r, I =[1,00) 

(b) f(x)=x-—logz, I= (0,1] 


The following corollary to Theorem F38 will be useful in later units. 


Corollary F39 Zero Derivative Theorem 


Let f be continuous on an interval J and differentiable on the interior 
of I. If f'(x) = 0 for all x in the interior of J, then f is constant on T. 


Proof Theorem F38(a) and (b) both apply, so f is both increasing and 
decreasing on J. Hence f is constant on T. E 


We can often determine whether a point c such that f’(c) = 0 is a local 
maximum or a local minimum of a function f by using the following test. 


Theorem F40 Second Derivative Test 


Let f be a twice-differentiable function defined on an open interval I 
containing a point c such that f'(c) = 0 and f” is continuous at c. 


(a) If f’(c) > 0, then f(c) is a local minimum of f. 
(b) If f”(c) <0, then f(c) is a local maximum of f. 


Proof We prove part (a); the proof of part (b) is similar, so we omit it. 


®. We first use the ¢-d definition of continuity with € = 5 f” (c) to show 
that f"(x) > 0 for x close to c. A similar technique was used in the proof 
of the Quotient Rule in Subsection 2.1. .@ 


Suppose that f”(c) > 0. Since f” is continuous at c, there exists ô > 0 
such that (c — 6,c+ ô) C I and 


|F") = F'O < $f" (ce), for xe (e—4,c +4), 
so 


f(x) > $f") > 0, for x € (c—6,c+ ô). 
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Thus f’ is strictly increasing on the open interval (c — 6,c + ô), by the 
strict inequalities version of the Increasing—Decreasing Theorem. Since 
f'(c) = 0, we deduce that 


f'(z) <0, for xz €(c—4,c), 
f'(z)>0, for x € (c, c+ ô). 


Thus f has a local minimum at c, by another application of the strict 
inequalities version of the Increasing—Decreasing Theorem. E 


Note that if f”(c) = 0, then the Second Derivative Test gives us no 
information about local extreme values. For example, the function 
f(x) = z? satisfies f'(0) = 0 and f”(0) = 0, but it has neither a local 
maximum nor a local minimum at 0. 


Exercise F30 


Consider the function 
f(z) = £? — 327 +1. 
(a) Determine those points c such that f’(c) = 0. 


(b) Using the Second Derivative Test, determine whether the points c 
found in part (a) correspond to local maxima or local minima, and 
find the values of these local maxima or local minima. 


Proving inequalities 


We now demonstrate how the Increasing—Decreasing Theorem can be used 
to prove certain inequalities involving differentiable functions. 


First we prove a generalisation of Bernoulli’s Inequality. Recall from 
Subsection 3.5 of Unit D1 that Bernoulli’s Inequality states that 


(l+a)”">1l+4+nz, forx>-landneN. 


In the next worked exercise we show that this inequality still holds if we 
replace n by any real number a > 1. 


4 Mean Value Theorem 


Worked Exercise F20 


Let a > 1. Prove that 


(l+2)*>1l+az, forg>-l. 


Solution 


The case œ = 1 holds by Bernoulli’s Inequality, so we can assume that 
a > 1. Define the function 


f(z) =(1+2)*—(1+az) (xe [-1,00)). 


We want to show that f(x) > 0 for x € [—1,00). Since 
f(0) =1—1=0, this is equivalent to showing that 


f(z) > f(0), for z € |-1, o0). 
We do this by showing that 
f is increasing on (0,00) and decreasing on [—1,0). («1) 


Now the function f is continuous on [—1,0o) and differentiable on 
(—1,00), with derivative 


Po oa) a 
=a(+a)%"—-1), for x € (—1,00). («2) 
If x > 0, then 1+ 2 > 1, so 
(Upa >l orel. 


®@. Here we have used Rule 5 for rearranging inequalities from 
Section 2 of Unit D1: 


ihe Gif) 2 (0) Baas ay 0 thema -0 — IS 


We have used this with p=a—1>0. @ 
Hence, by equation (*2), 
Gf (Oss tors = 0, 


so f is increasing on (0,00), by the Increasing—Decreasing Theorem. 
Similarly, if —1 < x < 0, then 0 < 1 +x < 1, so 
(Upa <i, r= <r <0 


Hence, by equation (*2), 
f(z) <0, for-l1<2<0, 


so f is decreasing on (—1,0). This proves statement (*1). 
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Worked Exercise F20 illustrates the following general strategy for using the 
Increasing—Decreasing Theorem to prove inequalities. 


Strategy F7 
To prove that g(x) > h(x), for x € [a,b], carry out the following steps. 
1. Let 


f(x) = g(x) — h(@), 
and show that f is continuous on [a,b] and differentiable on (a, b). 
2. Prove that 
either f(a)>0 and f’(x) >0 for x € (a,b), 
or fio) =O" and (a) = 0 tors = (a,b). 


The diagrams in Figure 20 below illustrate why Strategy F7 works. 


YA 


y = f(z) 


Figure 20 ‘The two cases in Strategy F7 


Remarks 


1. There is a corresponding version of Strategy F7 in which the weak 
inequalities are replaced by strict inequalities. 


2. We can also apply Strategy F7 to intervals of the form | a, oo) if the first 
case in step 2 holds, and to intervals of the form (—oo, b] if the second 
case in step 2 holds. 


3. Notice that in Worked Exercise F20 we used both cases in step 2 of 
Strategy F7. 


Exercise F31 


Prove the following inequalities. 
(a) sing < xz, for x € [0, 00) (b) 4a+4> x?’ , for x € [0,1] 
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’ an’ ’ 
5 L’HO6pital’s Rule 
In Section 1 we found the derivatives of sin and exp by using the results 


sin x . er —1 


x> £ x—0 Sh 


form 


where f and g are continuous functions with f(c) = g(c) = 0. Such limits 
cannot be evaluated by the Quotient Rule for limits, because this rule 
requires lim g(x) 4 0. 

T>e 


There are similar problems with the following more complicated limits: 


cos 3x r? 


lim —— and lim ———. 
r—>r/2 sin g — est z—>0 cosh g — 1 


Do they exist? And if they do, what are their values? 


In this section you will meet a result called l’Hopital’s Rule, which enables 
us to answer such questions. 


5.1 Cauchy’s Mean Value Theorem 


In Section 4 you met the Mean Value Theorem which asserts that, under 
certain conditions, a function defined on a closed interval has the property 
that at some intermediate point, the tangent to its graph is parallel to the 
chord joining the endpoints. To prove |’Hopital’s Rule, we will need the 
following generalisation of the Mean Value Theorem. 


Theorem F41 Cauchy’s Mean Value Theorem 


Let f and g be continuous on [a,b] and differentiable on (a,b). Then 
there exists a point c € (a,b) such that 


f'()\(9() — g(a)) = g'(c)(F@) — F(a); 
in particular, if g(b) Æ g(a) and g/(c) £0, then 
HORM ORO 


g'(c) g(b) — gla) 
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Remarks 


1. Note that Cauchy’s Mean Value Theorem involves two functions defined 


on a closed interval [a,b] and, subject to the stated conditions, gives us 
an expression for the ratio of their derivatives at some point c € (a,b). 
(It is this expression that we need for the proof of l’H6pital’s Rule.) If 
we put g(x) = x, then Cauchy’s Mean Value Theorem reduces to the 
usual Mean Value Theorem. 


. There is a geometric interpretation of Cauchy’s Mean Value Theorem 


that you may find helpful. Recall from Subsection 5.4 of Unit A4 that 
we can describe a curve in R? by specifying the x- and y-coordinates of 
its points using two functions, f and g, to define parametric equations 


r=g(t), y= f(t), 


where the parameter t belongs to some suitable interval fa, b]. Thus we 
can think of the two functions in Cauchy’s Mean Value Theorem as 
defining a curve in this way; see Figure 21. 


Figure 21 A geometric interpretation of Cauchy’s Mean Value Theorem 


Now it can be shown that, subject to certain conditions, the gradient of 
/ 

f (o) (though we do 

g'(c 

not prove this here). Thus, Cauchy’s Mean Value Theorem tells us that 

there is some value c in the interval (a,b) such that the gradient of the 


(b) 


b) — 
curve at the point (g(c), f(c)) is equal to the gradient — of the 


) 
chord joining the endpoints of the curve, (g(a), f(a)) and (g(b), f(b)). 


the tangent to this curve at the point (g(c), f(c)) is 


From 1815 to 1830 Augustin-Louis Cauchy (1789-1857) taught at the 
famous Ecole Polytechnique in Paris, the Ecole founded for the 
training of engineers. Cauchy wrote several textbooks on analysis 
designed for the students there, including his Cours d’Analyse of 1821 
and his Résumé des leçons données a l’Ecole Royale Polytechnique sur 
le calcul infinitésimal of 1823, the latter containing the result now 
known as the Cauchy (or Generalised) Mean Value Theorem. His aim, 
as he explained in the Cours d’Analyse, was to endow proof in 
analysis with the same level of rigour as proof in Euclid’s geometry. 
However, the originality of his lectures within the programme of the 
Ecole was not looked upon favourably, and he was criticised by the 
director of the Ecole who warned Cauchy that ‘It is the opinion of 
many persons that the instruction in pure mathematics is being 
carried too far at the Ecole and such an uncalled-for extravagance is 
prejudicial to other branches of mathematics.’ 


Proof of Theorem F41 Consider the function 
h(x) = f(x)(g(b) — g(a)) — g(x)(f (b) — f (a)). 


By the Combination Rules for continuous functions and for differentiable 
functions, h is continuous on [a,b] and differentiable on (a,b). Also, 


h(a) = f(a)(9(b) — g(a)) — 9(a)(F() — Fla) 
= f(a)g(b) — g(a) f (b) 


so h(a) = h(b). 


Thus h satisfies the conditions of Rolle’s Theorem on [a,b], so there exists 
a point c € (a,b) for which 


he) =0; 
that is, 
f'()\(g(0) — gla)) = 9'(O)(F(0) — f(a) = 0. 


The two equations in the statement of Cauchy’s Mean Value Theorem now 
follow by rearranging this equation. | 
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5.2  L’HO6pital’s Rule and its application 


We are now in a position to prove the main result of this section. 


Theorem F42 L’H6pital’s Rule 


Let f and g be differentiable on an open interval J containing the 
point c, and suppose that f(c) = g(c) = 0. Then 


7 
lim fl) exists and equals lim ie, 
zc g(x) ze g'(x) 
provided that the latter limit exists. 
Proof We assume that 
/ 

lim fa) exists and equals 1, (10) 
rc g x) 


and we want to deduce that 
lim fie) =h 
zc g(x) 
®. We use the -ô definition of limit from Subsection 3.3 of Unit F1. @ 
Let ¢ > 0. Then, by statement (10), there exists ô > 0 such that 
HORE 
g'(x) 
In particular, g'(x) Æ 0, for 0 < |z — c| < ô. 


(11) 


<e, forall x with 0 < |z — c| < ô. (12) 


Suppose now that æ is such that 0 < |x — c| < 6. If g(x) = g(c), then g’ 
must vanish at some point between c and x (by Rolle’s Theorem), which 
we know is not the case. Therefore, g(x) 4 g(c). Thus, by Cauchy’s Mean 
Value Theorem, there exists some point d between c and x such that 


f'(a) _ f(z) - f(c) 
g(d) g(x) — gle) 


= a) (since f(c) = g(c) = 0). 


Thus, by statement (12), we have 


fe) |_| £r@ 
g(a) g(a) 


It follows that statement (11) is true, as required. E 


-1 


<e, for all x with 0 < |æ — c| < ô. 


In the early 1690s the Marquis de l’H6pital (1661-1704) contracted 
Johann Bernoulli (1667-1748) to teach him the recently published 
Leibnizian differential calculus. The result was the first textbook ever 
written on the calculus, l’H6pital’s Analyse des infiniment petits, pour 
l'intelligence des lignes courbes (The analysis of the infinitely small, 
for the understanding of curved lines), published in Paris in 1696. It 
contains what it is now known as |’Hopital’s rule, although |’Hopital 
learnt the rule from Bernoulli. When l’H6pital published his Analyse 
he acknowledged all the instruction he had received from Bernoulli 
but in such a way that Bernoulli took offence. 


We can use |’Hopital’s Rule to evaluate the two complicated limits 
mentioned in the introduction to this section. 


Worked Exercise F21 


Prove that 
. cos 3x 
im ——————— 
r—r/2 sin g — esT 


exists, and determine its value. 


Solution 
Let J = R and define 


f(z) =cos3z and g(x) =sinz—e** (ER). 


Then f and g are differentiable, and f(7/2) = g(a/2) = 0; hence f 
and g satisfy the conditions of l’Hopital’s Rule at the point x = 7/2. 
Now the derivatives of f and g are 


COS & 


f'(x)=—3sin3z and g'(r)=cosxr+e* sing (x ER). 


Since g/ (7/2) = 1 #0, and f’ and g’ are both continuous, we deduce, 
by the Combination Rules for continuous functions, that 


fila) _ f(r /2) 
Shaul = 
z—>r/2 g'(x)  g'(7/2) 
_(3)xC)_, 
iL 
Thus, by ’Hopital’s Rule, the required limit exists and 


cos 3x 


I e 
aa Sn ay = E 
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The next example requires two applications of l’Hopital’s Rule. 


Worked Exercise F22 


Prove that 
lim —————— 
z—>0 cosha — 1 


exists, and determine its value. 


Solution 
Let J = R and define 
f(z)=a? and g(x)=coshz—1 (x ER). 
Then f and g are differentiable, and f(0) = g(0) =0; hence f and g 
satisfy the conditions of l’Hopital’s Rule at the point x = 0. 
Now the derivatives of f and g are 
f'(x)=22 and g/(z)=sinhz (@ ER). 


Thus, by ’Hopital’s Rule, the required limit exists and equals 
Dee 


wm 
z—>0 sinha’ 


provided that the limit (x) exists. 
®. Since g'(0) = 0, we cannot assert that 
ti uf 
0 

sm £@) _ £0), 

x40 g'(x)  g'(0) 
So we try to apply l’Hdpital’s Rule a second time. ® 
Both f’ and g' are differentiable, and f’(0) = g'(0) = 0; hence f’ and 
g' satisfy the conditions of l’Hdépital’s Rule at the point x = 0. Now 

f'(@)=2 and “¢'@)=coshe eR). 


Thus, by lHôpital’s Rule, limit (*) exists and equals 


lim ——— 
z—0 cosh x’ 
provided that this limit exists. But the function cosh is continuous 


on R, and cosh0 = 1. Thus, by the Quotient Rule for continuous 
functions, 
2 2 


lim =- =2, 
z—0 cosh g il 


Working backwards, we conclude that limit (*) exists and equals 2, so 


r2 


im == = f}, 
z>0 coshg— 1 
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Before asking you to apply l’Hopital’s Rule for yourself, we emphasise that 
you should always check carefully that its conditions hold, because a 
careless application can easily give an incorrect answer, as you will see in 
the following worked exercise! 


Worked Exercise F23 


Explain why the following proof is incorrect and find the correct value for 
the limit. 


Claim (incorrect!) 
227 -x-1 = 


lim —.—_— = 2. 
z—1 T =g 


Proof (incorrect!) Let I = R and define 
f(£)=2r° -x-1 and g(z)=z°-— z 


Then f and g are differentiable on R, and 
f(1) = g(1) = 0. 
So, by lHôpital’s Rule, 
f'(x) 
im 
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Exercise F32 


Prove that the following limits exist, and evaluate them. 


i z 1/5 _ (4 — »\1/5 
G ia sinh(« T) iia (1+ 2) (1-2) 
con = sin 3x x0 (1 + 2xr)2/5 — (1 — 2x)?/5 
sin(x?) . sing —xcosz 
w Ta MAS 
Summary 


In this unit you have met a formal definition of what it means for a 
function to be differentiable at a point and seen that, if a function f is 
differentiable at a point c, then the graph of f has a tangent at the point 
(c, f(c)). You have also seen how to use the definition to obtain the 
derivatives of some basic functions, and met many rules — in particular, the 
Glue Rule, the Combination Rules, the Composition Rule and the Inverse 
Function Rule — that enable us to show that many more functions are 
differentiable and to determine their derivatives. 


You have studied the properties of functions that are differentiable on an 
interval, and seen that local minima and maxima occur at places where the 
derivative is zero. You have met Rolle’s Theorem and seen how this is used 
in the proof of the Mean Value Theorem. This has useful corollaries such 
as the Increasing—Decreasing Theorem, which says that if the derivative of 
a function is positive then the function is increasing, while if the derivative 
is negative then the function is decreasing. This in turn enables us to use 
the second derivative test to determine whether a local extreme value is a 
maximum or minimum. Finally, you learnt how to use l’Hopital’s Rule to 


evaluate many limits of the form lim ce where f and g are differentiable 
re g(x 


functions with f(c) = g(c) = 0. 


Learning outcomes 


Learning outcomes 


After working through this unit, you should be able to: 


e explain what is meant by a differentiable function, and understand the 
geometric significance of differentiability 


e determine, using the definition, whether or not a function is 
differentiable at a point 


e explain what is meant by a second derivative and a higher-order 
derivative 


e explain what is meant by the left derivative and the right derivative of a 
function at a given point 


e state and use the Glue Rule for differentiation 
e use the table of standard derivatives 


e use the rules for differentiation to prove the differentiability of a 
particular function and to calculate its derivative 


e state and use the Local Extreme Value Theorem 
e state and use Rolle’s Theorem 
e state and use the Mean Value Theorem 


e state and use the Increasing—Decreasing Theorem and the Zero 
Derivative Theorem 


e understand the statements of Cauchy’s Mean Value Theorem and 
VH6pital’s Rule 
f(x) 


e use lHôpital’s Rule to evaluate certain limits of the form lim ——, 
zc g(x) 
where f(c) = g(c) = 0. 
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Table of standard derivatives 


a” loga 


f(z) f'(x) 

k 0 

x 1 

z”, nEZ— {0} ng”! 

z% acER ara! 

a”, a>0 

sing cos & 

Cos & — singz 

tan x sec? x 
cosec £ — cosec x cot x 
sec T sec x tan x 
cot £ — cosec? x 
sinha 1/V1 = 2? 
cos’! —1//1— 2? 
tan! 

er e” 

log x I/z 

sinh x cosh x 
cosh x sinh x 
tanh x sech? z 
sinh™! z 1/ V1 4+ 22 
cosh”! x 1/V22 -1 
tanh! z 1/(1 — 2?) 
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Domain of f’ 


R 

R 

R or R— {0} 

Rt 

R 

R 

R 
R—{(n+5)m:neZ} 
R—{na:neZ} 
R—{(n+5)m:neZ} 
R—{na:neZ} 


(=1,1) 
(=i, 1) 


“ig 1/(1 + 2°) 


Solutions to exercises 


Solution to Exercise F15 


(a) The difference quotient for f at c, where 
c#0, is 


guje Ler WS 


=] 
= here h Æ 0. 
(c+ h)c’ ee 
Thus Q(h) > —1/c? as h + 0. Hence f is 
differentiable at c, with f’(c) = —1/c?. 
(b) The difference quotient for f at 0 is 


on = LO =10) 


_ h? cos(1/h) — 0 
E h 
= hcos(1/h), where h 40. 
Now, |cos(1/h)| < 1 for h Æ 0, so 
IQh)| < |h], for h #0. 


Thus Q(h) > 0 as h > 0, by the Squeeze Rule for 
limits. Hence f is differentiable at 0, with 


f'(0) =0 
(c) The difference quotient for f at 0 is 


ati) = A= 10 


_ lhl =0 


h>0, 
h<0. 


Now consider the two null sequences 


1 
and k,=—-, 
n 


il 
hn = = ial le ee 
n 
These sequences have non-zero terms, and 
Q(hn) =1731 asn- oœ, 
but 


Q(kn) = —1 > —1 as n > oo. 


Solutions to exercises 


Since these limits are different, f is not 
differentiable at 0. 


(d) The difference quotient for f at 0 is 


f(h) = FO 
any = = 10 
|h" sin(1/h) — 0 
7 h 
sin(1/h) 
nila” h> 0, 
E sin(1/h) 
Ape? h < 0. 
Now consider the null sequence 
Rn = —— WH 1D xe 
(2n + 5) T 
which has positive terms. This gives 
sin(1/hn) 
Q(hn) = ye 


= (2n + ae nt? sin(2n + t)r 
= (2n + 4)? 72 => œ as n — o. 


Hence f is not differentiable at 0. 


Solution to Exercise F16 
The graph of y = f(x) is given below. 


RY 


(This is included to aid your understanding — you 
do not need to sketch the graph as part of your 
solution.) 


Let J = R and define 


g(x) =—a* (ER) and A(z)=2? (rE R). 
Then 

f(2)=9@),. tora <0, w 

Je=— ha). forg >00; 
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so condition 1 of the Glue Rule holds (with c = 0). 


Furthermore, f(0) = g(0) = h(0) = 0, so 
condition 2 holds, and g and A are differentiable 
with 
g'(x) = —2x 
so condition 3 holds. 


Since g'(0) = h’(0) = 0, it follows from the Glue 
Rule that f is differentiable at 0 and f’(0) = 0. 


(x ER) and R(x) = 2x 


Also, by equations (*), 
f'(x) = g'(x) = —2x, 
f'(&) =W (2) = 2x, 

since differentiability is a local property. 


Hence f is differentiable (on R), and 


for x < 0, 
for x > 0, 


=2 <0, 
fi(zj= 4 0, r=0, 
22, 2 > 0. 
Thus 
f'(x) =2\x| (we R). 


Solution to Exercise F17 


In each case we use the Combination Rules. 


(a) f'(r) = 72° — 823 +927-5 (xeR) 
proa a -e ee Ss 

= ae (x € R— {1}) 
(c) f'(x) = cos? x — sin? x 

=cos2x (x €R) 
(d) f'(x) 


_ (3 +sinzg — 2 cos g)e” — e” (cos x + 2sin x) 


(3 + sin x — 2 cos x)? 
e*(3 — sin x — 3 cos x) (2 ER) 
— LE 
(3 + sin x — 2 cos x)? 


Solution to Exercise F18 


We have 
f(a) = e + ze? = e?(1 + z), 
f'(a) =e(1 +2) +e = e(24 2), 
f(x) = e (2+ x£) +e = e” (342). 
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(x € R), 


Solution to Exercise F19 


In each case, we use the Quotient Rule and the 
derivatives of sin and cos. 


(a) f(x) = tanz = sin z/cos z, so 


cos x cos x — sin x(— sin x) 


f(a) = 


cos? x 


=— = sec? x 
cos? x 
on the domain of f. 
(b) f(z) = cosec x = 1/sin x, so 
cos £ 
f'(2) A 2 
sinf x 
= — cosec x cot x 


on the domain of f. 


(c) f(z) = secz = 1/cosz, so 


f(a) = cos? x 


= sec qz tan x 


sin x 


on the domain of f. 
(d) f(x) = cot x = cos xz /sin x, so 
sin z(— sin x) — cos £ cos £ 


f(z) = a) 


sinf x 
1 


= =e = —cosec? x 
sin* x 


on the domain of f. 


Solution to Exercise F20 
In each case we use the Combination Rules. 
(a) f(z) =sinha = $ (e? —e~*), so 
F(z) = zle +e) 
= cosh z. 
(b) f(x) = coshz = $(e” + e7”), so 
f'(x) = z(e —e™*) 
= sinha. 
(c) f(x) = tanh z = sinh z/cosh z, so 
f'(2) = 


cosh x cosh x — sinh z sinh x 
cosh? z 
1 


= z = sech? x 
cosh’ x 


Solution to Exercise F21 
(a) f(z) = sinh(z’), so 
f' (x) = 2z cosh(a?). 
(b) f(x) = sin(sinh 27), so 
f'(x) = 2cos(sinh 2x) cosh 27. 
cos 2x 


(©) Ala) =sin( S 


) @€ 000), 


so on this interval 


cos 2x x? (—2sin 2x) — 2x cos 2x 
fæ) = cos( z2 )( ( ) ) 


2 2 
= —— (z sin 2x + cos 2x) cos( z), 
£ 


T 


r2 


Solution to Exercise F22 

(a) The function 
f(x) =cosx (a € (0,7)) 

is continuous and strictly decreasing, and 


f((0,7)) = (1, 1). 


Also, f is differentiable on (0,7), and its derivative 
f'(x) = — sin z is non-zero there. 


Thus f satisfies the conditions of the Inverse 
Function Rule. 
Hence f has an inverse function f~! and 
f-! =cos"! is differentiable on (—1,1). If 
y = f(x) = cos x, then 
1 1 

fl À yY = ———— = 

TOE 7 
Since sin z > 0 on (0,7) and sin? x + cos? x = 1, it 
follows that 


sing = y 1 — cos? z = y1 — y2, 


sina 


so 
—1 


J1—y 


Replacing the domain variable y by x, we obtain 


(fy) = 


(cos~+)'(a) = = (x € (—1,1)). 


Solutions to exercises 


(b) The function 


f(x)=sinhz (x €R) 


is continuous and strictly increasing, and 

f(R) =R. 

Also, f is differentiable on R, and its derivative 
f'(x) = cosh z is non-zero there. 


Thus f satisfies the conditions of the Inverse 
Function Rule. 
Hence f has an inverse function f~! and 


f=! = sinh™! is differentiable on R. If 
y = f(x) = sinh z, then 
1 1 
J: 4 yY = ——— = 
YO = 
Since cosh x > 0 on R and cosh? x = 1 + sinh? z, it 
follows that 


cosha = vV 1 + sinh? zr = y1 +42, 


cosh z` 


so 
1 


Jae 
Replacing the domain variable y by x, we obtain 
1 
Ve 


Solution to Exercise F23 


(Ya) = 


(sinh—!)/(x) = (x ER). 


(a) If xı < xo, then x? < 28, so f(x1) < f (a2). 
Thus f is strictly increasing and continuous on R, 
and f(R) =R. Also 


f'(x) =52*+10, 


Thus f satisfies the conditions of the Inverse 
Function Rule. Hence f has an inverse function 
f~t which is differentiable on R. 

(b) Now, f(0) = —1, f(1) = 1 and f(-1) = -3. 
Hence, by the Inverse Function Rule, 


for x ER. 
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Solution to Exercise F24 
By definition, 
f(z) = x” =exp(zlogr) (x €R*). 


The functions x — x and x +> log x are 
differentiable on R”, and exp is differentiable on R. 
It follows by the Product Rule and the 
Composition Rule that f is differentiable on RT, 
and that 
f'(x) = exp(z log x) (log x + x x (1/z)) 
=x"(logr+1) («€R?*). 


Solution to Exercise F25 


Since the functions sin and cos are continuous and 
differentiable on R, so also is f, by the 
Combination Rules. 


Now, 

f'(x) = 2sin z cos x — sin x = sinx(2cosz — 1); 
thus f’ vanishes on (0,7/2) when cosg = 4, that 
is, when x = 7/3. 

Since f(0) = 1, f(t/2) = 1 and 

f(m/3) = (V3/2? +3 =3 +353 

it follows that on [0, 7/2]: 


the minimum of f is 1, occurring at x = 0 and 7/2; 


the maximum of f is 5 occurring at ea fd: 


Solution to Exercise F26 

Since f is a polynomial function, f is continuous 
on [1,3] and differentiable on (1,3). Also, f(1) = 2 
and f(3) = 2, so f(1) = f(3). 

Thus f satisfies the conditions of Rolle’s Theorem 
on [1,3], so there exists c in (1,3) such that 

f'(c) =0. 

(In fact, since 


f' (a) = 42? — 122 + 6x = 2a(2x? — 6x + 3), 


and 2x? — 6x + 3 = 0 for z = 4(3 + V3), we have 
c= $(3+ V3) ~ 2.37.) 
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Solution to Exercise F27 
(a) No: f is not defined at 7/2. 
(b) No: f is not differentiable at 1. 


(c) Yes: all the conditions are satisfied. 


(d) No: f(0) 4 f(/2). 


Solution to Exercise F28 


The function f(x) = xe” is continuous on [0,2] and 
differentiable on (0,2) by the Product Rule. Thus 
f satisfies the conditions of the Mean Value 
Theorem on {0, 2]. 


Now, 
f(2)-f) _ 2e7-0 _ 2 
2—0 2 f 
Thus, by the Mean Value Theorem, there exists a 
point c in (0,2) such that f’(c) = e?. 


Solution to Exercise F29 


(a) The function f is continuous on J and 
differentiable on the interior of J and so we can 
apply the Increasing—Decreasing Theorem. We 
have f'(x) = 4x! — 4 = 4(7/3 — 1). Thus 

f'(x) > 0 for x € (1,00), so f is strictly increasing 
on [1,0o), by the strict inequalities version of the 
Increasing—Decreasing Theorem. 


(b) The function f is continuous on J and 
differentiable on the interior of J and so we can 
apply the Increasing—Decreasing Theorem. We 
have f'(x) =1—1/x = (x — 1)/x. Thus f’(x) <0 
for x € (0,1), so f is strictly decreasing on (0, 1], 
by the strict inequalities version of the 
Increasing—Decreasing Theorem. 


Solution to Exercise F30 
(a) We have 
f (£) = 32? — 62 = 32(a — 2). 


Thus f'(x) = 0 for x = 0 and 2, so the required 
values of c are 0 and 2. 


(b) We have 
f(x) = 6x — 6, 


sO 
f’(0)=-6<0 and f"(2)=6>0. 
Also, f(0) = 1 and f(2) = —3. 


Since f is a twice-differentiable function defined on 
R and f” is continuous on R, it follows from the 
Second Derivative Test that f has a local 
maximum of 1 at x = 0 and a local minimum of —3 
at 7 = 2. 


Solution to Exercise F31 
In each case we follow the steps in Strategy F”. 
(a) 1. Let 

f(z) =ax—-sinz (æ € [0,o00)). 


Then f is continuous on [0,00) and differentiable 
on (0,00). 
2. We have 

f'(z) =1-—cosx>0, 
and f(0) =0. 


Thus f is increasing on [0, 00), by the 
Increasing—Decreasing Theorem, so 


for x € (0,00), 


f(x) > f(0)=0, for x € (0,00). 
Hence 

sing <x, for x € (0,00). 
(b) 1. Let 


i= za + - — g?” (x € [0,1]). 
Then f is continuous on [0,1] and differentiable 
on (0, 1). 
2. We have 
/ = 2 2 ,—1/3 
Fe) =3-3a 
= 3(1- a3) <0, for x € (0,1), 
and f(1) = 3+4%—-1=0. 
Thus f is decreasing on [0,1], by the 
Increasing—Decreasing Theorem, so 


f(x) > f(1)=0, for x € [0,1]. 


2x + = > r, for x € [0,1]. 


Solutions to exercises 


Solution to Exercise F32 
(a) Let I = R and define 


f(x) =sinh(z-—7) (x eR) 

and 
glx) =sin3x (xzeR). 

Then f and g are differentiable on R, and 
f(r) = g(r) = 0. 


Thus f and g satisfy the conditions of l’H6pital’s 
Rule at the point x = 0. 


Now, 
f'(x) = cosh(x — r) 


Since g/(7) = —3 £0, and f’ and g’ are both 
continuous, we deduce, by the Combination Rules 
for continuous functions, that 


f(z) _ fir) 1 1 
ron g'(x) g(a) -3 2 
Thus, by l’Hôpital’s Rule, the required limit exists 
and equals —. 
(b) Let I = (—3, 5): say, and define 


f(x) =(1+2)/-(1-—2)¥5 


and g'(x) = 3cos3z. 


and 
g(x) = (1+ 2z)” — (1 — 2x)?” 


on I. (The only requirements when selecting the 
open interval J are that it must contain O and lie in 
the domains of both f and g; that is, all x € I 
must satisfy 1 — 2x > 0 and 14 2x > 0.) 


Then f and g are differentiable on J and 
f(0) = g(0) =0. 


Thus f and g satisfy the conditions of l’ Hôpital’s 
Rule at the point x = 0. 


Now, 


fiz) =40 42) 4"? + 30-2) 


g' (x) = $(1 +21)? + 4(1 — 22)-3/*. 
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Since g'(0) = $ + $ #0, and f’ and g' are both 
continuous, we deduce, by the Combination Rules 
for continuous functions, that 


f(z) FO _5+3 _1 
ro ga) gO) $48 © 
Thus, by ?Hopital’s Rule, the required limit exists 
and equals i: 


(c) Let J = R and define 


f(z) = sin(x?) and g(z)=1—cos4¢ (x ER). 


Then f and g are differentiable on R, and 

f(0) = g(0) = 0. 
Thus f and g satisfy the conditions of l’ Hôpital’s 
Rule at the point z = 0. 


Now, 
f'(z) = 2zcos(zx?°) and g’(x) = 4sin4r. 


Thus, by ?Hopital’s Rule, the required limit exists 
and equals 
/ 

lim f(a) («1) 
provided that limit (*1) exists. Here 
f’(0) = g’(0) = 0, so we cannot apply l’Hopital’s 
Rule at this stage. However, f’ and g’ are 
differentiable on R, so f’ and g’ satisfy the 
conditions of l’H6pital’s Rule at the point x = 0. 


Now, 

f" (æ) = — (2x)? sin(x?) + 2 cos(x?) 
and 

g” (x) = 16 cos 42. 


Thus, by lHôpital’s Rule, the required limit exists 
and equals 
1 

im i) (*2) 

z>0 g (x) 
provided that limit (+2) exists. 
Since g”(0) = 16 40, and f” and g” are both 
continuous, we deduce, by the Combination Rules 
for continuous functions, that 


f(a) F'O) _ 2 _ı 


0 g”(æ) g"(0) 16 ® 


Hence limit (x2) exists and equals £. 
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Thus limit (x1) exists and equals $, so the required 


limit also exists and equals +. 


8 
(d) Let J = R and define 
f(z) =sinzx—axcosz and g(x)=2° (« €R). 


Then f and g are differentiable on R, and 
f(0) = g(0) = 0. Thus f and g satisfy the 
conditions of l’H6pital’s Rule at the point « = 0. 


Now, 

f'(z)=asinz and g'(x) = 32’. 

Thus, by ’Hopital’s Rule, the required limit exists 
and equals 


f'(@) 


240 g'(x) 


h] 


provided that this limit exists. But 


xsin g i 
lim = im 4 = = lim = =, 
r>0 3a 3250 T 3 


Hence, the required limit exists and equals z. 


Unit F3 
Integration 


1 The Riemann integral 


Introduction 


In this unit you will study the question: 


What do we mean by the area between the graph of a real function f 
and the x-axis? 


You will see how this can be answered by trapping the required area 
between increasingly accurate lower and upper estimates, each of which is 
the sum of the areas of suitably chosen rectangles. The area between the 
graph y = f(x) and the segment [a,b] of the x-axis is defined to be the 
supremum of the lower estimates and the infimum of the upper estimates, 
as long as these two values are equal. In this case, we call the common 
value the integral of f on [a,b], written 


fi or f toa 


You will see that, for many functions, we can evaluate integrals by using 
the Fundamental Theorem of Calculus, which allows us to think of 
integration as the inverse operation of differentiation. Although we will 
review techniques of integration, our main focus in this unit is on 
providing a rigorous foundation for the idea of integration, and on showing 
how this relates to concepts you have met in previous analysis units. 


Often it is not possible to evaluate an integral explicitly, and later in the 
unit you will meet methods for obtaining upper and lower bounds for the 
integral in such cases. You will also see how we can apply integration to 
derive some remarkable formulas for m and for estimating factorials, and to 
give a useful additional test for the convergence of certain series. 


1 The Riemann integral 


The purpose of this section is to give a rigorous definition of what we mean 
by the area between the graph 


y= f(x) (x€ [a,b]) 


and the closed interval [a,b] on the x-axis, and to explore its implications. 
We begin with an informal discussion to set the scene. 
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We have an intuitive notion of area. There are formulas for calculating the 
areas of simple geometric shapes such as rectangles and triangles, and we 
would certainly want our rigorous definition to agree with these. We would 
probably also agree that the region between the z-axis and the graph of a 
continuous function defined on a closed interval [a,b] has a definite area, 
even if we are uncertain how to measure it. On the other hand, for 
discontinuous functions it is not obvious that we can always say that the 
region between a graph and the z-axis has an area. For example, can we 
define such an area for the function 


zx’, O0<2<1, 
2 1<r<2, 


f(z) = 


which has a discontinuity at the point x = 1? The graph of f is illustrated 
in Figure l(a). 


YA YA y=1, xrEQ 
24 o EEN concwosevsantuaectaciace 
y = f(z) y = g(x) 
1 
|_ 7 aad Ste 
l m> —QeeeeeesooeeeeeoeosooeeeeoeeO—> 
0 1 2 T pg 


(a) (b) 


Figure 1 The graphs of the discontinuous functions f and g 


As another example, can we define the area between the x-axis and the 
graph of the Dirichlet function on the closed interval [0,1], illustrated in 
Figure 1(b)? You met the Dirichlet function in Subsection 3.2 of 

Unit F1 Limits. Its rule is 


1, O0O<a< 1, z rational, 
g(x) = 


0, O<a2 <1, zx irrational, 


and in Unit F1 it was shown to be discontinuous at every point of its 
domain. 


It seems desirable that our definition of area should cover a wide range of 
functions. Later in this section we will prove that we can always assign a 
value to the area between the graph and the z-axis for a continuous 
function defined on a closed interval. You will also see that we can do the 
same for the function illustrated in Figure l(a), but not for the function 
illustrated in Figure 1(b). 


Our definition of area is based on finding lower and upper estimates for the 
‘area’ (if it exists) of the region between the graph y = f(x) and the 
x-axis, using the following approach. First we divide the interval fa, b] into 
a set of subintervals, called a partition of [a,b]. Then we consider two sets 
of rectangles, each rectangle having one of the subintervals as its base. In 
one set, we choose rectangles whose top edges lie on or below the graph, so 


the sum of their individual areas forms a lower estimate for the ‘area’ of 
the region; see Figure 2(a). In the other, we choose rectangles whose top 
edges lie on or above the graph, so the sum of their individual areas forms 
an upper estimate for the ‘area’; see Figure 2(b). 


Ys y= f(z) WwW y= f(z) 


o~ 
Ry 

~ 
RY 


(a) (b) 


Figure 2 (a) A lower estimate and (b) an upper estimate for the area of the 
region between the graph y = f(x) and the z-axis 


In this way we can obtain many lower estimates and upper estimates by 
choosing different partitions of [a,b]. If there is a real number A with the 
properties 


the supremum of the lower estimates = A 
and 
the infimum of the upper estimates = A, 


then we define A to be the area between the graph and the x-axis. We call 
the number A the integral of f on [a,b], and denote it by 


[i or [ fon 


We make all these ideas precise in the rest of this section, which is the 
longest and hardest section of the unit. On a first reading, you may wish 
to try to understand the main ideas without following every detail. The 
details may be easier to understand on a second reading. 


1.1 Definition of the integral 


In this subsection we work towards giving a rigorous definition of the area 
between the graph of a function f defined on a closed interval [a,b] and the 
x-axis; that is, the integral of f on [a,b]. 


Before we can give the definition, we need to introduce a number of key 
ideas. In the paragraphs below you will study: 


e some important terminology for functions 
e what is meant by a partition of a closed interval 


e the use of lower and upper Riemann sums to estimate areas. 
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Terminology for functions 


In Unit D1 Numbers you met the definitions of lower bound, greatest lower 
bound, upper bound and least upper bound of sets in R. Here you will 
meet analogous definitions for functions defined on an interval in R. These 
definitions of greatest lower bound and least upper bound generalise the 
notion of the minimum and mazimum of a function which you met in 
Unit D4 Continuity. The minimum and maximum of a function are 
illustrated in Figure 3 and we give a reminder of their definitions (in a 
slightly different form from those you saw in Unit D4) together with the 
definitions of lower and upper bounds for functions. 


YA YA 
M- 


(a) (b) 


Figure 3 (a) The minimum of a function (b) The maximum of a different 
function 


Definitions 


Let the function f be defined on the closed interval [a,b]. Then the 
following hold on fa, b]. 
e f is bounded below on [a,b] with m as a lower bound if 
f(z) =m, for all z € fa,b]. 
e mis the minimum of f on [a,b] if 
1. m is a lower bound for f on [a,b], and 
2. f(c) = m, for some c € [a,b]. 
Thus m = min{ f(x) : a <x < 6}, which we also write as min f or 
simply as min f. le 
e f is bounded above on [a,b] with M as an upper bound if 
f(z) <M, for all z € [a,b]. 
e M is the maximum of f on [a,b] if 
1. M is an upper bound for f on [a,b], and 
2. f(d) = M, for some d € [a,b]. 
Thus M = max{f(x) : a <x < b}, which we also write as max f or 
simply as max f. [as 


e f is bounded on [a, }] if it is both bounded below and bounded 
above on [a,b]. 


1 The Riemann integral 


Note that any lower bound for a function f on [a,b] is less than or equal to 
any upper bound for f on [a,b]. 


A function f that is continuous on a closed interval [a,b] necessarily has 
both a minimum and a maximum, by the Extreme Value Theorem for 
continuous functions, as you saw in Subsection 3.3 of Unit D4. However, if 
f is not continuous on [a,b], then it may or may not have a minimum or a 
maximum on [a,b]; for example, the function in Figure 4(a) has neither a 
minimum nor a maximum on |a, b], whilst the function in Figure 4(b) has a 
maximum but no minimum on [a,b]. 


w= f(z) 


Figure 4 Two bounded functions on [a,b]: (a) a function with neither a 
minimum nor a maximum (b) a function with a maximum but no minimum 


Both functions in Figure 4 are certainly bounded on [a,b]; and there are 
numbers m and M, as shown, such that m < f(x) < M for all x € fa, b]. 
However, although the first function f takes values as close as we please to 
m and M, there is no point x in [a,b] where f(x) =m or M; and although 
the second function f takes values as close as we please to m, there is no 
point x in [a,b] where f(x) =m. This suggests the notions of greatest 
lower bound and least upper bound. 


Definitions 
Let the function f be defined on the closed interval [a,b]. Then 
e m is the infimum or greatest lower bound of f on {a, }] if 
1. m is a lower bound for f on [a,b], and 
2. if m >m, then f(c) < m’, for some c € [a,b]. 


Thus m = inf{ f(x) :a@< x < b}, which we also write as inf f or 


la, 


simply as inf f. 

e M is the supremum or least upper bound of f on [a,b] if 
1. M is an upper bound for f on |a,b], and 
2. if M’ < M, then f(d) > M’, for some d € fa, b]. 


Thus M = sup{ f(x): a <x < b}, which we also write as sup f or 
[a,b] 


simply as sup f. 
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Remarks 


1. Note from the definition that the concepts of the infimum of a real 
function and the infimum of a set of real numbers are related in the 
following way: the infimum of a function is the infimum of the image set 
of the function. A similar remark applies to supremums. 


2. Any function f that is bounded on [a,b] necessarily possesses an 
infimum and a supremum on [a,b]: if m is any lower bound of f on 
a,b], then inf f > m and if M is any upper bound of f on [a,b], then 


sup f < M. 
3. If min f exists, then inf f = min f. Similarly, if max f exists, then 
[a,b] [a,b] [a,b] [a,b] 
sup f = max f. 
[a,b] [a,b] 


Worked Exercise F24 


Consider the function 


rozl =l] <z <1, 


a—a2, 1<a<2. 


Sketch the graph of f and identify min f, max f, inf f and sup f (if they 
exist). 
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For each of the following functions f on [—1, 1], sketch the graph of f and 
identify min f, max f, inf f and sup f (if they exist). 


Partitions of a closed interval 


We now introduce the notion of a partition of a closed interval, which will 
play an important role in estimating the area between the graph of a 
function and the x-axis. 


1 The Riemann integral 
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Definitions 
A partition P of a closed interval [a,b] is a collection of a finite 
number of closed subintervals of [a, b], 
PN riper ile knalo a ki Pihos a ete 
where 
m= << a Koo Ky Ks KH, = 0. 
The points z;, O0 < i < n, are called the partition points of P. 


The ith subinterval is [z;—1, x;], 1 < i < n, and its length is 
denoted by 62; = xj — 24-1. 


The mesh of P is the quantity || P|] = max Oda 
<i<n 


A standard partition is a partition with subintervals of equal 
length. 


These definitions are illustrated in Figure 5. 


P 
oo Qccc 
a=% tı oes Wed Diy we Ln-1 Ln =b 
Ox; 
{> 


Figure 5 A partition P of an interval [a,b] 


Worked Exercise F25 


Let P be the partition of [0,1] given by 


P= {(0, 5], [5,3]. [5, a], [2 1 }- 
Find the mesh of P. 
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Exercise F34 


Write down the standard partition P of [—1,2] that contains 
four subintervals, and state the mesh of P. 


Lower and upper Riemann sums 


Next, we introduce the lower and upper Riemann sums for a bounded 
function f on an interval [a,b] with partition P; these correspond to 
underestimates and overestimates in our intuitive notion of the area 
between the graph y = f(x) and the z-axis. 


Definitions 


Let f be a bounded function on [a,b], and let P be the partition 
Lona Lans 2al 02 eee per Pomi Zol wie ng = 0 amd 
i, = lo, et 


mog = w f) s w E be seal 
and 

Iy = aoo fcr) oo E oi l 
for i = 1,2,...,n. 


Then the lower Riemann sum for f on [a,b] with partition P is 
n 
L(G) = eran 
i=1 
and the upper Riemann sum for f on [a,b] with partition P is 


OG = > Mada 
j= 


Note that the above definitions work equally well whether f takes positive 
or negative values in fa, b]. Regions between the graph y = f(x) and the 
x-axis where f takes negative values make a negative contribution to the 
lower and upper Riemann sums. In this subsection we will in general 
illustrate results for functions that are non-negative throughout [a,b], but 
we give further consideration to functions that are negative on all or part 
of the interval [a,b] in Subsection 1.3. 
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The terms m; and M; in the definitions denote the greatest lower bound 
and least upper bound of f on the ith subinterval of the partition; we need 
to use the infimum and supremum on the subintervals since the function f 
may not be continuous and so may not have a minimum or maximum on 
all (or any) subintervals. The lower Riemann sum is the sum of the areas 
of the rectangles with height m; and width ôx;, giving a lower estimate for 
the area between the graph and the z-axis, as illustrated in Figure 6(a). 
The upper Riemann sum is the sum of the areas of the rectangles with 
height M; and width 62;, giving an upper estimate for the area between 
the graph and the z-axis, as illustrated in Figure 6(b). 


YA YA 


y = f(x) 


i > > 
Lo Tı £2 £3 +++ Ln-1 Tn T Lo Ti £2 T3 +++ Ln-1 Tn T 


(a) (b) 


Figure 6 The rectangles whose areas are included in (a) the lower Riemann 
sum L(f, P) and (b) the upper Riemann sum U(f, P) 


Now, on any interval, the greatest lower bound of a function f is 
necessarily less than or equal to its least upper bound. It follows that, in 
each subinterval [x;—1, xi], we have m; < Mi. Summing from i = 1 to n, we 
obtain the following result. 


Theorem F43 


Let f be a bounded function on [a,b], and let P be a partition 
of [a,b]. Then 


L(f,P) < Uf, P). 


Worked Exercise F26 


Let 


20 O<a<il, 
f(z) = 
1, «=0,1, 


and let 
P= {(0, 4], [4,3], [> 4]> [451] } 
0, 


be a partition of [0,1]. Determine L(f, P) and U(f, P). 
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Solution 


®. It is helpful to make a sketch of the graph of f and the partition of 
the interval [0, 1]. 


0 T 3 
2 4 


í 
4 
Using the notation in the definitions, we set out the information 
needed to calculate the lower and upper Riemann sums using a layout 
that will make the calculation straightforward. Note that this 
function is increasing except at x = 0 and at x = 1, so we have to take 
special care at these points: the values of Mı and m4 are not what 
you might expect, and the values of mı and M4 are not taken by the 
function. © 


For the four subintervals in P, we have 


mı = 0, Mı = f(0) = , 6x1 T 
a 
m3 = f($ =1, M3 = f(#) = 8, 6x3 = Fe 
m=) M= 2, D E 
Then 
4 
L(f,P) =) mid 
i=l 
T a) xy) 
=O+}+3+4 
-Š 
8 
and 
4 
U(f,P) =>) Mid; 
j=l 
= (1x4) 


+ (1x a) + (2% 4) + (2% 3) 
3 
3 
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Exercise F35 


Let 

Seat 
and let 

P= {[0,3]; [5-2} [at 


1 
5? 
be a partition of [0,1]. Determine L(f, P) and U(f, P). 


Worked Exercise F26 and Exercise F35 addressed the same function f on 
the same interval [0,1] but with different partitions. The two lower sums 
were 5/8 and 31/50, and the two upper sums were 11/8 and 3/2; each of 
the two lower sums is smaller than both of the two upper sums. 


We now look at another example to see whether this happens again in a 
different situation. We make use of the following formula for a sum of 
squares which you may have met in your previous studies (it can be proved 
by mathematical induction): 


1)(2 1 
POF + +n? -y = Mat Dene). for = l 2 ieee: 


Worked Exercise F27 


Let 


x, O0<¢< 1, 
a. 1<xr<2, 


and, for each n € N, let 
Pon = {[0, 5], [oah o [2 21 


be the standard partition of [0,2] into 2n equal subintervals. Determine 
L(f, Pon) and U(f, Pan). 
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Solution 
@. The graph of f is shown below. 


YA 
Dy oo 
y = f(z) 
iA 
0 i r 


The function f is increasing on [0,2]. Thus, on each subinterval in 
[0, 2], the infimum of f is the value of f at the left endpoint of the 
subinterval and the supremum of f is the value of f at the right 
endpoint of the subinterval. Since the ith subinterval in Pən is 


eer $ =Í 


= eee 
m=? T m= s(4), a ee et 
w n n n n 


Now f(z) = x? on [0,1] and f(z) = 2 on (1,2] so, since zp = 1, we 
have 


2 
i=l . 
m= ( ) a tor — sD errata 


i 
|, we have tori = Nh nea 2m 
n 


n 
and 
its =2, Word = FF 2, WFP Boa aay 2 


Hence 
n+1 


OE) -Y mðn = BS 2 Mj OX; 


i=n+2 
z (i E) x + ((n—1) x2) x = 


eee l = 
£ (ei) 
n n n 


n(n + 1)(2n + 1) 2 
F : 
6n? n 
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Also, we have 


and 
M = 2 tor En n 2e 2m: 


Hence 


2n 
UGE Poy) — oe M;62; 
i=l 


n 2n 
= > M;ôxi + y MOT 
= 


i=n+1 


(ae oe T 
= (A ta 
n n 


= n(n + 1)(2n + 1) 
6n? 


We now look at the result of Worked Exercise F27 for different values of n. 
As n increases, the number, 2n, of subintervals in the partition increases 
and the length of each subinterval, 1/n, decreases. From the above 
formulas for L(f, Pən) and U(f, Pan), we find that, to three decimal places, 
the lower and upper Riemann sums are then as given in the following table. 


L(f, Pon) U(f, Pon) 
n=2: 4 equal subintervals 1.625 2.625 


n=: 8 equal subintervals 1.969 2.469 
n = 10: 20 equal subintervals 2.185 2.385 
n = 100: 200 equal subintervals 2.318 2.338 


As the subintervals increase in number and decrease in length, the lower 
sums increase and the upper sums decrease. But the lower sums are all less 
than or equal to all the upper sums! In fact this is always the case, as 
stated in the following result. 
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Theorem F44 


Let f be a bounded function on [a,b], and let P and P’ be partitions 
of [a,b]. Then 


L(f, P) <U(f, P’). 


You saw in Theorem F43 that, for a given partition P, 
L(f, P) < U(f, P). 


In order to prove the general case in Theorem F44 with two different 
partitions P and P’ we require some new ideas. We develop these ideas 
and prove this result in Subsection 1.4. 


The integral 


We now return to our original problem: how to define an ‘integral’ that 
pins down our intuitive notion of ‘the area under a curve’. We have seen 
that lower Riemann sums provide underestimates for this ‘area’ and upper 
Riemann sums provide overestimates; see Figure 7. 


YA 


YA 


y = f(z) 


> 
Zo Tı T2 T3 +++ Ln-1 Tn © 


Figure 7 Lower and upper Riemann sums 
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So we make the following definitions. 


Definitions 


Let f be a bounded function on a closed interval [a,b], let P be a 
partition of [a,b] and let L(f, P) and U(f, P), respectively, be the 
corresponding lower and upper Riemann sums. 


Then the lower integral of f on [a,b] is 


b 
E 
Ja 12 


and the upper integral of f on [a,b] is 


Fb 
f = inf U(f, P). 


We say that f is integrable on [a,b] if 


[ss fo 


and their common value is then called the integral of f on [a,b]. 


b b 
The integral is written as | f or i f(a) dx, and a and b are called 


the limits of integration. 


Remarks 


1. The common value of the lower and upper integrals of f (when it exists) 
is sometimes known as the Riemann integral of f, rather than simply as 
the integral of f. 


2. In the definitions, sup L(f, P) is the supremum of the lower Riemann 
P 


sums over all possible partitions P of the interval [a,b]. Similarly, 
inf U(f, P) is the infimum of the upper Riemann sums over all possible 


partitions P. 


3. For any bounded function f on an interval [a,b], it follows from the fact 
that all lower Riemann sums are less than or equal to all upper 
Riemann sums (Theorem F44), and from the above definitions, that the 


b b 
lower integral f f and the upper integral / f both exist, and that 
a a 


we always have 


[ts ft 


(We omit the proofs of these facts.) Note, however, that the lower and 
upper integrals take different values unless f is integrable. 
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We now return to the function f that we considered earlier in Worked 
Exercise F27. 


Worked Exercise F28 


Let 


2 
Prove that f is integrable on [0,2], and evaluate f f. 
0 


Solution 
®. It is helpful to sketch the graph. 


YA 

2 oeo 
y = f(x) 

la 


We have already seen in Worked Exercise F27 that if we take Pən to 
be the partition 


(Paea b-i 


of [0, 2], then 
n(n + 1)(2n + 1) 2 
and 
n(n + 1)(2n + 1) 
U(f, R) = 6n3 2 
Then, as n — oo, we have 
1+1 2+1 2 1 
P a e weer oes 
6 mo g 3 


so that, in particular, 


[ret 
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2 
®. This holds because l f is defined to be the supremum of the 
0 


lower Riemann sums of f over all possible partitions of [0, 2], so it 
must be greater than or equal to the particular lower Riemann 
sum L(f, Pan) for any value of n. ® 


Similarly, as n — oo, we have 


Ci a meee, eel 


so that 
T 
[rss 
0 3 
We have now shown that 
2 aD 
les i f< i ged 
37 Jo 0 3 
It follows that 
2 
a 
Jo 0 


2 
so f is integrable on [0,2] and a J= L 
0 


However, not all bounded functions defined on closed intervals are 
integrable! 


Worked Exercise F29 


Let f be the Dirichlet function on [0,1] defined by 


1, O<a<1, x rational 
f(x) = are 
0, O<a <1, z irrational. 


1 71 
Determine the values of f f and J f, and hence show that f is not 
JO 0 


integrable on [0, 1]. 
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Solution 


®. The graph of f is shown below. 


YA 
] Pcceccccccccccccccccccccccce e 
—Oeseseesesccscesssecescessse > a 


Let P= they a wie tole ewe). herero — 0; a7-— 1 be 
any partition of [0,1]. Then, on each subinterval [z;—1, xi] in P, we 
have 

file =O ail Mb il, nor 8 = 1,2 coon ft 


®. This is because every subinterval contains both rational and 
irrational points, by the density property of the real numbers (see 
Subsection 1.4 of Unit D1). @ 


So 


LF P) = = MjOX; = 5 (0 x O) = 0 
=l 


i= 
and 
COAR) = z Mi;ôzxi = z (1 x O) = 2 = il 
j=l j= g= 


since the sum of the lengths of all the subintervals is equal to the 
length of the interval [0, 1]. 


It follows that 
1 
f fasti.p)=0 
Jo P 
and 


71 
i f =inf U(f, P) =p 


1 71 
Then, since | TFs f f, we conclude that f is not integrable 
J0 0 


on [0,1]. 
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Exercise F36 
For the function 
f(z)=7xz, xe [01], 
and the standard partition of [0, 1] 
r= {otl "afb nen, 
n| |n n n 


determine the values of L(f, Pa) and U(f, Pa). Hence show whether f is 


1 
integrable on [0,1] and, if it is, determine the value of f f. 
0 


1.2 Criteria for integrability 


It would be tedious to have to go back to the definition of integrability 
whenever we wish to show that a given function is integrable on a closed 
interval. You will now meet a number of criteria that we can use to avoid 
this. 


In order to prove that a bounded function f is integrable on [a,b] directly 
from the definition of the integral, we need to look at sup L(f, P) and 
P 


inf U(f, P) over all partitions P of [a,b]. However, as you saw in Worked 


Exercise F28 and Exercise F36, in many situations it is sufficient to 
consider just one sequence of partitions (P,,) in order to establish 
integrability. The conditions under which this simplification holds are set 
out in the following result. (Recall that ||P,,||, the mesh of Pa, is the length 
of the longest subinterval of Pn.) 


Theorem F45 


Let f be a bounded function on [a,b]. If there is a sequence of 
partitions (P,,) of [a,b] such that ||P,,|| > 0 and 


lim L(f,P,) = lim U(f,P,) =A, where AER, 
noo n—> oo 


b 
then f is integrable on [a,b] and / j = A 


Proof Let £ > 0. It follows from the equations in the statement of the 
theorem that there exists an integer n such that 


L(f,Pa)> A— łe and U(f, Pa) < A+ 4e. (1) 


Now, by the definitions of upper and lower integrals, 


b Fb 
f EIHP, and Ee (2) 
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Combining inequalities (1) and (2), we obtain 
b Fb 
A- łe< f f< f f<A+3e. 


®. Here we have used the inequality [ Ts [ f mentioned in the 


remarks after the definition of the integral in the previous subsection. As 
stated there, this follows from Theorem F44. .@& 


Since £ is any positive number, we deduce that the upper and lower 
integrals of f on [a,b] are equal to A, so f is integrable and 


[tra a 


In fact the following result in the opposite direction to Theorem F45 also 
holds. However, its proof is somewhat more complicated, and we defer this 
to Subsection 1.4. 


Theorem F46 


If f is an integrable function on [a,b] and (Pp) is a sequence of 
partitions of [a,b] such that ||P,|| > 0, then 


dim Lf, Pa) = im U P a Jo 


This result is particularly useful for proving that a function is not 
integrable. For if f is defined on [a,b] and we can find a sequence of 
partitions (P,,) of [a,b] whose mesh tends to zero but for which 


jim Lf, Pa) # lim U(f, Pa), 


then it follows from Theorem F46 that f is not integrable on [a,b]. 


Exercise F37 


For each of the following functions f, determine whether f is integrable 
1 


on [0,1] and, if it is, find f f. 
0 


=). 0<z<1, 
a t)= 
(a) Fa) {3 A 
b) f(x) zx, O<a<1, zx rational, 
4 64 = 
0, O<a<1, 7 irrational. 
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The next result is of particular interest in that its statement says nothing 
about the value of the integral itself: it mentions only the difference 


between the lower and the upper Riemann sums. The result follows from 
Theorems F45 and F46. 


Corollary F47 Riemann’s Criterion 
Let f be bounded on [a,b]. Then 

f is integrable on [a, b] 
if and only if 


there is a sequence (P,,) of partitions of [a,b] with ||P,|| + 0 
such that U(f, Pa) — L(f, Pn) > 0. 


Proof Theorem F46 implies that if f is integrable on [a,b], and (P,) is a 
sequence of partitions of [a,b] with ||P,,|| > 0, then 


On the other hand, if there is a sequence (P,,) of partitions of [a,b] with 
||P, || + 0 such that U(f, P,) — L(f, Pa) > 0, then, because 


L(f, Pa) ays < fi < U(f, Pa), 


these upper and lower integrals must be equal, so f is integrable on [a,b], 
by Theorem F45. E 


Two classes of integrable functions 


With Riemann’s Criterion at our disposal, we can now determine some 
large classes of functions that are always integrable: the monotonic 
functions and the continuous functions. 


Theorem F48 


A function f which is bounded and monotonic on fa, b] is integrable 
on [a,b]. 


Proof We prove this theorem in the case when f is increasing on fa, b]. 
(The proof is similar if f is decreasing.) 


Consider the standard partition of fa, b]; that is, 


Pr = {[z0, £1], [mis x2], see tetas (3) 
where 
.b—a 
zti=za+i , fori=0,1,2,...,n. (4) 
n 


1 The Riemann integral 


Now f is increasing, so on each subinterval [x;_1, x;], for i = 1,2,...,n, 
YA 
mMi = fltizi) and Mi = EA 
see Figure 8. Also, 62; = (b — a)/n, for i = 1,2,...,n. Hence y= F(z) 
i=1 
b— a n = 
T 2 e= fle) to=@ zı XL = b= @p T 
b—a 7 Figure 8 ‘The standard 
a (f(@n) — f(0)) partition for an increasing 
b-a function f 
==" (F(0)— F(a)) 


®. Here we have used the fact that the terms of the series are 


fzr) — fleo), f(v2)— f(t1), =, Flan) — f(@n-1), 
so everything except f(zo) and f(x,) cancels. ©& 
= 


The sequence ((b — a)(f(b) — f(a))/n) is null, so it follows from 
Riemann’s Criterion that f is integrable on [a,b]. a 


Theorem F49 


A function f which is continuous on [a,b] is integrable on Ja, b]. 


Proof We use the fact that f must be uniformly continuous on [a,b]. 


®. You saw in Theorem F19 in Subsection 4.2 of Unit F1 that a function 
which is continuous on a bounded closed interval is uniformly continuous 


there. & 
Let £ > 0. Then there exists 6 > 0 such that 
If(x) - fy) < — for all x,y € [a,b] with |x — y| < ô. (5) 


®. We use ¢/(b— a) here in order to obtain £ later in the proof. & 


Next we choose N € N such that (b—a)/N < ô. For n> N, let P, be the 
standard partition of [a,b] given by equations (3) and (4). 


Now f is continuous on each subinterval [x;_1, xi], for i = 1,2,...,n. Thus, 
by the Extreme Value Theorem (see Subsection 3.3 of Unit D4), there exist 
points c; and d; in [x;-1, zi] such that 

mi = fle) and Mi= f(di;). (6) 
Since [x;-1, xi] has length (b — a)/n < ô, we deduce by statements (5) 
and (6) that 


& 
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= 
®. Here we have used the fact that |d; — ci| < es , from which it 
n 


follows that |f (d;i) — f(c:)| = Mi — mi; < — by statement (5). ® 
—a 


Hence, for n > N we have 


Thus (U(f, Pn) — L(f, Pn)) is a null sequence, so it follows from 
Riemann’s Criterion that f is integrable on [a,b]. E 


Theorems F48 and F49 show that monotonic functions and continuous 
functions are integrable, but we know that some bounded functions are not 
integrable: for example, the Dirichlet function, as you saw in Worked 
Exercise F29. This suggests the question: precisely which bounded 
functions are integrable? 


The full answer to this question is rather complicated but, roughly 
speaking, a bounded function is integrable on [a,b] if and only if it is 
continuous at ‘most’ points of [a,b]. However, it is possible for a function 
to be discontinuous at infinitely many points of [a,b] and yet be integrable 
on [a,b]. For example, the Riemann function which you met in Section 3 of 
Unit F1 is discontinuous at all rational points and yet it can be shown to 
be integrable on [0,1], the value of its integral being 0 (we do not prove 
this here). 


Riemann and Lebesgue integration 


Georg Friedrich Bernhard Riemann (1826-1866) laid down the 
fundamental ideas of the integral that is now named after him when 
he was writing his doctoral thesis in 1854. This was published 
posthumously in 1867. His formulation and proof were rather obscure 
and the version that is generally used today (including in this module) 
was given by the French mathematician Gaston Darboux (1842-1917) 
in 1875. 


In 1902 a different definition of the integral was given by another 
French mathematician, Henri Léon Lebesgue (1875-1941). The main 
difference between the two definitions is in the way the area under the 
curve is measured. The Riemann integral considers the area as being 
made up of vertical rectangles, while the Lebesgue integral considers 
horizontal rectangles. Or to put it another way, the Riemann integral 
considers the domain of the function while the Lebesgue integral 
considers the codomain of the function. Although Lebesgue’s 
definition has the advantage that it is applicable to a larger class of 
functions than Riemann’s definition, it requires the formal notion of a 
measure. (You may study measure theory in the future, but this topic 
is not covered in M208.) 


1 The Riemann integral 


Lebesgue himself provided a rather nice example to illustrate the 
difference between his approach and that of Riemann: 


I have to pay a certain sum, which I have collected in my pocket. 
I take the bills and coins out of my pocket and give them to the 
creditor in the order I find them until I have reached the total 
sum. This is the Riemann integral. But I can proceed differently. 
After I have taken out all my money I order the bills and coins 
according to identical values and then I pay the several heaps 
one after another to the creditor. This is my integral. 


(Source: Siegmund-Schultze, R. (2008) ‘Henri Lebesgue’, in Gower, T. (ed) 
The Princeton Companion to Mathematics, Princeton, Princeton University 
Press, p. 796.) 


Henri Léon Lebesgue 


1.3 Properties of integrals 


In your previous study of integration (for example, in a calculus course) 
you will have met many properties of integrals without a clear explanation 
of exactly why they hold. We now look at several of these properties and 
in some cases give an outline of how they follow from the definition of an 
integral in Subsection 1.1 and the various theorems that you met in 
Subsection 1.2. In reading this subsection, you should concentrate on 
understanding the various properties themselves; our comments on why 
the properties hold are optional reading in case you are interested. 


First, we look at the limits of the integral. We have already defined 
b 


integrals of the form f, where a < b; we now look at the situation 


a 
where a = b or a > b. 


Definitions 


Let f be a bounded function that is integrable on an interval I 
containing a and b, where a < b. Then we make the following 
definitions. 


° T= 0 
a b 
ef t=- 
b a 
0 1 
Thus, for example, f x dx is defined to equal — f x dx; you have already 
1 0 


1 0 
seen in Exercise F36 that | zdr = 5, so we define the value of f x dgx to 
0 1 


1 
be 3: 
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Next we look at the integrability of a bounded function on intervals with 
endpoints a, b and c, irrespective of the order of these endpoints on the 
z-axis. Figure 9 illustrates two possibilities: a < c < b and a < b < c. The 
following result applies whatever the order of a, b and c. 


YA YA 
y = f(x) y = f(z) 


(a) (b) 


Figure 9 Integrating a function on intervals with endpoints a, b and c 
(a) when a < c < b and (b) whena<b<c 


Theorem F50 Additivity of integrals 


Let f be a bounded function that is integrable on an interval I 
containing the points a, b and c. Then 


Pe E 2 
ey 


This result can be proved directly from the definitions above, and the 
definition of the integral given in Subsection 1.1. Notice that, in the 


b 
situation illustrated in Figure 9(b), I f is negative since b < c. 
C 


Our next result says that the integral of a non-negative function is 
non-negative, and the integral of a non-positive function is non-positive; 
see Figure 10. This result can also be proved directly from the definition of 
the integral. 
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(a) (b) 


Figure 10 The sign of the integral for (a) a non-negative function and (b) a 
non-positive function 


Theorem F51 Sign of an integral 
Let f be a bounded function that is integrable on [a,b]. 


b 
e If f(x) >0 on [a,b], then f Jf 20: 


b 
o If f(x) <0 on {a, th then f Fao 


Of course, a function can be non-negative on some parts of its domain and 
non-positive on other parts. In these circumstances, subintervals where the 
function takes negative values make a negative contribution to the total 
area between the graph of the function and the x-axis. If the interval [a,b] 
on which a function is defined is made up of finitely many subintervals 
where the function is either always i a or always non-positive, 


then we can evaluate the integral J f by applying Theorem F50. If there 
a 
are infinitely many such subintervals, then the evaluation of the integral 


involves summing an infinite series; we do not pursue this here. 


Next we note that if a function f is integrable, then so is |f|; see Figure 11. 


YA YA 


(a) (b) 
Figure 11 (a) The integral of f (b) The integral of |f| 
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Theorem F52 Modulus Rule 
If f is integrable on [a,b], then |f] is also integrable on Ja, b]. 


Outline of the proof (optional) 
For a bounded function f on [a,b] and a partition of [a,b] 
PS {[xo, £1], [£1, £2], shine [Zn—-1, Ln] }, 


we define, for i = 1,2,...,n, the variation w;(f) of f over the subinterval 
[xi—1, xi] to be 


wi(f) = sup{| f(x) — f(y]: x,y € [vi~ zil}. 
It can be shown that 

wi(f) = sup{ f(x) : x € [aj_-1, xi] } — inf{ f(a) : x € [x;-1, zil}. 
(We omit the details here.) 


Hence 
U(f,P) — L(f,P) = >) wil f)d2i, (7) 
=l 


where 6x; = £i — £i—1, as usual. 


Now, by the backwards form of the Triangle Inequality which you met in 
Subsection 3.1 of Unit D1, we have 


IFE IEIS F) F), for x,y € [xi-1, xi], 
so that 
wi(|f]) <wi(f), fori=1,2,...,n. 
Hence, by equation (7), 
U(|fl, P)— L(F|, P) < U(f, P)- L(f, P), (8) 


for any partition P of [a,b]. We can now use Riemann’s Criterion to 
deduce from inequality (8) that if f is integrable on [a,b], then so is |f|. E 


Finally, we set out the Combination Rules for integrable functions; we use 
these a great deal to construct ‘new integrable functions from old’. 
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Theorem F53 Combination Rules for integrable functions 
If f and g are integrable on [a,b], then so are the following functions. 


Sum Rule f +g, with integral 


furo=fs+ fs 


Multiple Rule Af, for À € R, with integral 


m 


Product Rule fg 
Quotient Rule f/g, provided that 1/g is bounded on [a,b]. 


You will meet some techniques for finding the integrals of products and 
quotients in the next section. 


The Combination Rules can be proved using the same approach as for the 
Modulus Rule. The proof uses the following inequalities, which relate the 
variations of the new functions over a subinterval [xz;—1, £;i] of a partition to 
those of the known integrable functions f and g: 


e wi(f +g) < wil f) + wilg) 
e wi(Af) < |Alwi(f), for AER 
e wi(fg) < M(wi(f) + wi(g)), where M = max{sup |f], sup |g|}- 


1.4 Proofs of Theorem F44 and 
Theorem F46 (optional) 


In the proofs of these results, we will use the notion of a refinement of a 
partition. If P is a partition of an interval [a,b], then any partition 
obtained from P by adding to it a finite number of partition points is 
called a refinement of P. The partition of [a,b] obtained from two 
partitions P and P’ of [a,b] by using all their partition points is called the 
common refinement of P and P’. 
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For example, for the partitions 
P= {[0, 3l [4-3], [2.4] [4.13 

and 
P' = { [0,5]; [5 3) [5:1] 

of [0, 1], the common refinement of P and P’ is the partition 
{0 4], [a 3h [ah a 3b 5h t 


as illustrated in Figure 12. 


10 il 1 13 i. > 
4 2 4 
1 12 / 
0 13 13 1 
| 
y 7 y id y 
| | 
| l 
j ' — / common 
0 L A 1 2 3 1 
4 3 2 3 4 refinement 


Figure 12 The common refinement of two partitions of [0, 1] 


We first prove Theorem F44, which is restated below. 


Theorem F44 


Let f be a bounded function on [a,b], and let P and P’ be partitions 


of [a,b]. Then 
IP P) £ UP): 


Proof In our proof we will assume that f is non-negative on [a,b]. The 
general result can be deduced by applying the ‘non-negative version’ of the 
result to the function g = f + c, where c is a constant so large that g is 
non-negative on [a,b]; such a constant c exists since f is bounded on Ja, b]. 


First recall that, from Theorem F43, for any partition P of [a,b] we have 


L(f,P) <U(f, P). 


We will prove Theorem F44 by showing that if P” is the common 


refinement of P and P’, then 
LP PyalG Pag?) SUP]: 
We claim that adding a new partition point 2’ to a partition 


P= {[z0, z1], [£1, £2], sey =e En] } 
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of [a,b] does not increase the upper Riemann sum and may decrease it. 
This is because the new point 2’ lies in a subinterval [x;_1, xi], for some i, 
and the only effect on the upper Riemann sum of adding 2’ is to replace 
the rectangle with side M; standing on [z;—1, z;] with a pair of adjacent 
rectangles standing on [x;_1,x;] with heights at most M;, as shown in 
Figure 13. 


Ti—1 Ti Ti—1 x Ti 


Figure 13 The effect on the upper Riemann sum of adding a new partition 
point 


Hence adding a finite number of new partition points to P does not 
increase the upper Riemann sum. Similarly, adding a finite number of new 
partition points to P does not decrease the lower Riemann sum. 


Now the partition P” can be formed from either P or P’ by adding a finite 
number of partition points, so inequalities (9) follow. E 


To end this subsection we give the proof of Theorem F46. 


Theorem F46 


If f is an integrable function on [a,b] and (Pp) is a sequence of 
partitions of [a,b] such that ||P,|| > 0, then 


Mer URR) = im Ufa Po B Í. 


n—> o0 


b 
Proof We prove that im Uf Pr = / f. The proof that 


Jim, LAS Ps y= ff f is similar. 


We assume that f is non-negative on [a,b] so, for some M € R, we have 
0< f(x) <M, fora<2¢< bd. (10) 


Let € > 0. Since f is integrable on [a,b], there is a partition 


P= {{29, 1], [z1 £9], e.’ E E 


of [a,b] where x) = a and z}, = b, with m subintervals, such that 
b 
UEP) < f Fte (11) 


®. We use Je here in order to obtain € later in the proof. ® 
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Figure 14 Removing a point 
x; of a partition 
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Now consider any partition in the sequence (P,,), of the form 
Pe = {ip Bil ertha Sol}, 

where p is the number of subintervals in Pa. For k = 1,2,...,p, we define 
My = sup{ f (£) : £k-1 < £ < £k} and 62, = Ek — Tk—1- 


Let P’ denote the common refinement of P’ and Pp. Then, as in the proof 
of Theorem F44, we have 


UG.) < UCF, P’). (12) 
Now we can obtain P, from P’ by removing at most m — 1 of the partition 
points of P’. 


®. This is because P’ has m + 1 partition points, and all partitions have 
the points a and b in common. ® 


Removing such a point x/, lying in (£k—1, £k) say, we increase the upper 
Riemann sum by at most Mp(£k — £k—1), as illustrated in Figure 14. So, 
since M;, < M, by inequality (10), and £k — £k—1 < ||Prl|, for 

k=l 2.20250; 


U(f, Pa) < U(f, Pa) + (m—1)M||Pall. (13) 
Combining inequalities (11), (12) and (13), we obtain 


b 
(SPa) < ( | F+ $e) + 0n- DMIPAN 
a 
Since ||P,,|| > 0 as n > œœ, we can choose N so large that 
(m—1)M||Pal| < $£, for alln >N. 


®. We use Zé here in order to obtain € in the next step of the proof. ® 


Hence 


b b 
Ul Lee (/ f+4e] +łe= | fte, fraln>N. 
b 
Since U(f, P,) > f f, by the definition of the integral, we deduce that 


b b 
vue- f f =U P) - f aa hainesi 


a 


b 
®. So U(f, Pn) — f f tends to zero as n tends to oo. .& 


b 
Hence lim U(f, Pa) = / f, as required. Oo 
n—> o0 a 


2 Evaluation of integrals 


In Section 1 you saw what it means for a bounded function defined on a 
closed interval to be integrable. In this section you will study a variety of 
techniques for evaluating the integrals of such functions. 


2.1 The Fundamental Theorem of Calculus 


In this subsection we show that integration and differentiation are 
intimately related by proving a result known as the Fundamental Theorem 
of Calculus. You probably know this result from a previous course on 
calculus, but it is worth pausing to reflect on how remarkable it is. In 
Section 1 we defined the integral by means of more and more accurate 
estimates of the area between the graph of a function and the x-axis. At 
no point did it seem that this process was related to differentiation, but 
the Fundamental Theorem of Calculus shows that integration and 
differentiation are in some sense inverse processes. As you will see, this 
fact is enormously helpful in evaluating integrals. 


We begin our exploration of these ideas by defining a primitive of a 
function. 


Definition 
Let f be a function defined on an interval J. Then a function F is a 
primitive of f on I if F is differentiable on J and 


PQ) — ja), termes. 


It follows from this definition that finding a primitive is the inverse of 
finding a derivative. (For this reason, a primitive is sometimes called an 
antiderivative.) Note that the domain of the primitive F may be larger 
than the interval J on which f is defined. 


As an example of finding a primitive, let 
f(x) = tanz. 

Then the function 
F(x) = log(sec x) 


is a primitive of f on the interval (—7r/2, 7/2), since 


1 
F'(x) = = secgtanz = tang, for x € (—a/2,7/2). 
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Exercise F38 
(a) Let 
f(a) = (z? — 4)? (w € (2, 00)). 
Prove that 
F(a) = log (z + (2? — 4)"2) 


is a primitive of f on (2,00). 


(b) Let 
f(z) = sech (= È ), 


cosh x 


Prove that 
F(x) = tan7*(sinh z) 


is a primitive of f on R. 


We now state and prove our main result, the Fundamental Theorem of 
Calculus. The theorem tells us that we can evaluate the integral of a 
function f on an interval [a,b] by finding a primitive F of f on [a,b]. Note 
that the expression F(b) — F(a) is sometimes written as [F(«)|° or F(2)|°. 
Theorem F54 Fundamental Theorem of Calculus 


Let f be integrable on [a,b] and let F be a primitive of f on [a,b]. 
Then 


b 
f f=FO-FO). 


Proof Let 

Pa = {[£0; £1], 3005 [£i-1; til -s Eaten be for n= 1,2... 
be a sequence of partitions of [a,b], with zo = a, £n = b and ||P,|| > 0. 
On each subinterval 

tiiti fori =i; eeii 


the function F satisfies the conditions of the Mean Value Theorem which 
you met in Subsection 4.1 of Unit F2 Differentiation, since a primitive is 
differentiable and hence continuous. Thus there exists a point 
Ci E€ (£i—1, zi) such that 
F (ai) — F(xi-1) = F' (ci) (xi — ti—1) 
= f(ci) xi, (14) 


where 62; = £i — £i—1. 


@. Now recall that we use the notation m; = inf{f(x) : x € [z;_-1, xi]} and 
M; = sup{ f(z) : z € [ai-1,0;]}. © 


Since 
mis Te) My tora =— 1,2, eT; 


it follows that 
i=1 i=1 i=1 


Using equation (14), we can rewrite this statement as 
n 
Les) a a aU). 
i=1 


®. We now use telescopic cancellation to evaluate the series in these 
inequalities, which is 

(F(a1) — F(20)) + (a2) — F(21)) +++ + (F(tn) - F(@n-1)). 
The only remaining terms after cancellation are F (xn) and —F (zo). ® 
The series has sum F (zn) — F(xo) = F(b) — F(a), so 

L(f, Pn) < F(b) — F(a) < UCF, Pn). (15) 


Since f is integrable on [a,b], the sequences (L(f, Pa)) and (U(f, Pn)) both 
b 
converge to f f, by Theorem F46. It follows from inequalities (15) and 


the Limit Inequality Rule for sequences (Theorem D11 in Unit D2 
Sequences) that 


[rsr@-re@s fs, 


which gives the required result. E 


b 
Theorem F54 shows the close relationship between the integral f f and 
a 


any primitive F of f on [a,b]. Because of this result, a primitive F is also 
called an indefinite integral of f and denoted by f f(x)dx. Moreover, 


the process of finding a primitive of f is often informally called 
integrating f, and in this context the function f is called an integrand. 


b 
Also, the integral f f is often referred to as the definite integral of f 
over [a,b]. g 


We can use Theorem F54 and the table of standard primitives at the end 
of this unit to evaluate many integrals. 
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Worked Exercise F30 


1 
Evaluate J 2” dr. 
0 


Exercise F39 


Using the Fundamental Theorem of Calculus and the table of standard 
primitives, evaluate the following integrals. 


(a) [Eta (b) f oszd 


2.2 Primitives 


It is natural to ask: can a function have more than one primitive on an 
interval? The answer to this question is ‘yes’: for example, on (—1, 1) the 
functions 


rm r? and ze r? +1 
are both primitives of the function 
GH 2r. 


However, any two primitives of a function f on an interval can differ only 
by a constant. 


Theorem F55 Uniqueness Theorem for Primitives 


Let Fy and F be primitives of f on an interval J. Then there exists 
some constant c such that 


F(x) = Fi(z) +c, forge. 


Proof Since F} and F, are primitives of f on J, 
Fi(z)= f(x) and F}(x)= f(x), forz el, 
so 
F}(x)— Fi(z) =0, forz eI. 


Thus, by the Zero Derivative Theorem (Corollary F39 in Unit F2), there 
exists a constant c such that 


Foə(x)— Fi(x)=c, forrel. a 


The range of primitives we can find is considerably extended by the use of 
several Combination Rules. These rules can be proved using the 
corresponding rules for derivatives; we omit the details. 


Theorem F56 Combination Rules for primitives 


Let F and G be primitives of f and g, respectively, on an interval J, 
and let A € R. Then, on I: 


Sum Rule f +g has a primitive F + G 
Multiple Rule Af has a primitive AF 
Scaling Rule x> f(z) has a primitive z +> Z F(a), 
tone O 


For example, it follows from the table of standard primitives and the 
Combination Rules that the function with domain R* and rule 


at 3x7! + sinh 2x 
has a primitive 
x e 3log x + 5 cosh 2x. 


In applications of these Combination Rules we do not usually mention the 
rules explicitly. 


Exercise F40 


Using the table of standard primitives and the Combination Rules, find a 
primitive of each of the following functions. 


(a) f(x) =4logz —2/(4+ 27) (x € (0,0)) 
(b) f(z) =2tan3z+e"cosx (x € (—1/6,7/6)) 
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2.3 Techniques of integration 


The Fundamental Theorem of Calculus provides a powerful method for 
evaluating certain integrals. However, even when we know that a function 
f has a primitive F, it may not be possible to determine F explicitly. 


In fact, most functions f, even quite simple ones, have primitives which are 
not standard functions. For example, the primitives 


ie dx 


cannot be expressed as a combination of a finite number of rational, nth 
root, trigonometric, exponential and logarithmic functions. (Note that 


dx 
here we have used a standard shorthand and written i —— instead of 


i (log x)? 
J Toga? 7?) 


However, by using the Combination Rules we can certainly integrate any 
polynomial function, and the primitive is then always another polynomial 
function. For example, 


(2? 045) de = 4a? — $a? + 5a. 


There is also is a standard procedure called the method of partial fractions 
for integrating a rational function. This procedure is not used in this 
module but is often used for evaluating the integrals of rational functions in 
complex analysis. Using this method, it can be shown that any primitive of 
a rational function can always be expressed in terms of rational functions, 
logarithms of rational functions and inverse tangents of linear functions. 


There are also various techniques which can be applied to certain other 
types of function; the art of integration lies in recognising these types. We 
now describe briefly some basic techniques of integration. 


Integration by substitution 


We describe two related techniques of integration by substitution. The 
first is used when the integrand is of the form 


xı—> f(9(x))g'(2). 
In this case, if F is a primitive of f, then 


d 


Tat a2) = F'(g(2))9' (2) = F(g(z))9"(@), 


by the Composition Rule for derivatives (also called the Chain Rule). Thus 
x > F(g(x)) is a primitive of  —> f(g(x))g’(x) on any interval in the 
domain of F o g. So if we substitute u = g(x), then we obtain a simpler 
integrand since 


ECOG dx = F(g(x)) = F(u) = fro du. (16) 


This technique is worth trying if you can express the integrand in the form 
f(g(a))g'(x), for some functions f and g, as illustrated in the next worked 
exercise. 


Worked Exercise F31 


Find a primitive of the function 


cr-—>a*(e2+1)8 (2 ER). 


Solution 
®. Differentiating x? + 1 gives 3x7, which suggests that we try putting 
u=g(z)=2°+1. # 
d 
Put u = 9(z) = 2° +1. Then = = gf e) = 30,60 U = 3a da 
E 


®. Now substitute to express the integral in terms of u, adjusting by 
a multiplicative constant as necessary. „® 


We have 


ae + de = if + 1)° 327 dx 


yfe du. 


@. Next, evaluate the integral in terms of u. ® 


iTO, 
x gu 


®. Finally, substitute back for u to obtain the result in terms of x. ® 
TEDE 


Thus +(x° + 1)° is a primitive of a (e + 1)8. 
27 


®. You can always check your result by differentiating. ® 


The following strategy summarises the approach taken in Worked 
Exercise F31. 
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Strategy F8 


To find a primitive | f(g(x))g'(x) dx using integration by 


substitution, do the following. 


d 
1. Choose u = g(x). Find = = g'(x) and hence express du in terms 
i 


of x and dz. 
2. Substitute u = g(x) and replace g/(x) dx by du (adjusting constants 


if necessary) to give | f(u) du. 


3. Find fro du. 


4. Substitute u = g(x) to give the required primitive in terms of x. 


Remarks 


1. If we are evaluating an integral, rather than finding a primitive, then 


there is no need to perform step 4 of Strategy F8. Instead, we can 
change the z-limits of integration into the corresponding u-limits: 


b g(b) 
f CEOE / Hida 
a g(a) 


For example, in Worked Exercise F31 we have 
u= g(x) = 2° +1, 
so 
whenx=0, u=1, andwhenz=1, u=2. 


Hence 


1 2 
: r? (x3 + 1)8 dx = t f u® du. 
0 1 


. When applying equation (16), you may be able to spot a primitive F 


of f immediately, in which case you can write down the required 
primitive F(g(x)) directly without need for a substitution. For example, 
if you were able to spot the primitive in Worked Exercise F31, you 
could just write 


foe +1)§ dz = fe + 1)? 8a? dz 


l 
3 
ix F(x’ +1) = #(x° +1)°. 


One particular situation in which this simplification can often be 
applied is with an integrand of the form g'(x)/g(x) on an interval J, 
since we have 


[2 dz = log(g(x)), if g(x) >0, forzre J, (17) 
g(x) 
because < log(g(x)) = aa x g'(x): 


An example involving this useful formula appears in the next exercise. 


Exercise F41 


Find a primitive of each of the following functions. 
(a) f(x) =sin(sin3z)cos3x (a €R) 

(b) f(x) =27(2+323)" (ER) 

(c) f(z) =asin(2x”) («€R) 

(da) f(z) =2/(2+327) (« €R) 


T 


Exercise F42 


Evaluate the integral 
1 x 
e 
— d. 
J +e” 


Our second substitution technique is a modification of the above method, 
which we call backwards substitution. It is based on the formula 


f teae= f EDK du, 


obtained from equation (16) by swapping the variables x and u. Here h is 
a function such that x = h(u), often found by first writing u = g(x) 
where g has an inverse function g~! = h. Backwards substitution is worth 
trying if it makes part of the integrand significantly simpler. The next 
worked exercise gives an example of the use of this technique. 
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Worked Exercise F32 


Find a primitive of the function 


e2t 


opie CE (0.00)). 


Ty 


@®. If u = g(x) = (e? — 1)'/?, then g has an inverse function 
x = h(u) = log(u? + 1). # 


Put u = g(x) = (e? — 1)!/2. Then the inverse function is 


d 
= a = log(u? + 1), so = = pu) = z = i and therefore 
u 
®. Now substitute to express the integral in terms of u. ® 
We have 


e27 elog(u? +1)? 2u 
Se d 
Veep a 


4 ae 2 
- [H z P 
u uł + 1 


= fe +1) du 


@. Next, evaluate the integral in terms of u. ® 


= žu” ap Al 


®. Finally, substitute back for u to obtain the result in terms of x. && 


= $(e" — 1)9/? + (e — 1)”. 


e2t 


Tus (e = 1) 2(e2 = EA asa primitive at, (@ 12" 
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The following strategy summarises the approach taken in Worked 
Exercise F32. 


Strategy F9 


To find a primitive J f(x) dx using integration by backwards 
substitution, do the following. 
1. Choose u = g(x), where g has an inverse function x = h(u). Find 


as 
a h'(u) and hence express dx in terms of u and du. 
u 


2. Substitute x = h(u) and replace dx by h’(u) du to give a primitive 
in terms of u. 


3. Find this primitive. 


4. Substitute u = g(x) to give the required primitive in terms of z. 


As before, if we are evaluating an integral, then instead of step 4 in 
Strategy F9, we can change the z-limits of integration into the 
corresponding u-limits: 


b (b) 
f ieas i ~ O a 
a g(a) 


For example, in Worked Exercise F32 we have 


u=g(z) = (e*- 1)", 


so 
when z =log2, u=1, and when zx =log3, u= v2. 
Hence 
log 3 e27 v2 2 
mt] 2(u“ + 1) du. 
J (e7 Z DA 1 


Exercise F43 
(a) Find a primitive of the function 


f(x) = : 


32-13 +2(2—)? (x € (1,00)), 


using the substitution u = (x — 1)'/?. 


(b) Evaluate the integral 


log 3 
f e” y1 + e dz. 
0 
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Integration by parts 


The technique of integration by parts is derived from the Product Rule for 
differentiation, 


(fo = F'g + Fg', 
which implies that 


fid = to- | Fo so [setae - f ta 


This formula converts the problem of finding a primitive of fg’ into the 
problem of finding a primitive of f'g. Integration by parts is worth trying 
if you can express the integrand as a product of two functions, f(x)g'(x), 
where f(x) becomes simpler on differentiation, and g'(x) becomes not 
much more complicated on integration. 


Here is a strategy for using integration by parts. 


Strategy F10 


To find a primitive | k(x) dx using integration by parts, do the 
following. 


1. Write the original function k in the form fg’, where f is a function 
that you can differentiate and g’ is a function that you can 
integrate. 


2. Use the formula fis = fo- |ts. 


The next two worked exercises give examples of using this technique. 


Worked Exercise F33 


Find 


fe cos x dz. 


Sometimes we have to multiply the integrand by the factor 1 in order to be 
able to apply integration by parts, as in the following example. 


Worked Exercise F34 


Evaluate the integral 


1 
J tan! zdz. 
0 


Exercise F44 


(a) Find a primitive of the function 
k(x) =a loge (a € Rt). 
(b) Evaluate the integral 


T/2 
2 
| x cos x dx. 
0 


Hint: Use integration by parts twice. 
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Sometimes we need to evaluate an integral [,, that involves a non-negative 
integer n. A common approach to such integrals is to relate the value of In 
to the value of In—1 or In_2 by a reduction formula (sometimes called a 
recurrence relation) using integration by parts. Here is an example that 
will be important later in the unit. 


Worked Exercise F35 
Let 
a /2 
r= sin” gdr, n=0,1,2,.... 
0 


(a) Evaluate Jp and A. 
(b) Prove that 


Ip ( — *) In-2, forn> 2. 
n 
(c) Deduce the values of I>, I3, I4 and Is. 
Solution 
an /2 
(a) We have w= Lan = m2 emal 
0 
T/2 x/2 
f=) Ssmsd = [= csn] = 

0 

(b) We write 


a /2 
Iy = i sin x sin”! «dz. 
0 


@. We integrate sin x and differentiate sin” 12. © 


Using integration by parts we find that, for n > 2, 


In = [(— cos 2) sin"? a] ue — 


T/2 
| (— cos x)(n — 1) sin"? z cos x dx 
0 
a /2 
=0+(n-1) | cos? z sin”? z dx 
0 


a /2 
= (n— 1) | (1 — sin? z)sin™ ? zdr 
0 


T2 T2 
= (n— 1) sin”? z dx -f sin” x dx 
0 0 


= (n—1)Un-2 — In). 
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By repeatedly applying the reduction formula in Worked Exercise F35, we 
obtain the general formulas 


| ae eae 

MDTA G In ě 2 
and 

E. o A eee _2n 

enpi g S T7 mL 


We will use these formulas in Subsection 3.2. 


Exercise F45 
Let 
1 
de =| CL dr, awa eek 
0 
(a) Evaluate Jp. 
(b) Prove that 
In =e-—nIn1, forn=1,2,.... 


(c) Deduce the values of I, I2, Ig and T4. 


3 Inequalities, sequences and series 


Often it is not possible to evaluate an integral explicitly, and a numerical 
estimate for its value is sufficient. This situation can arise both in 
applications of mathematics and in proofs that involve integration. In this 
section we study some inequalities satisfied by integrals, and apply these to 
find two remarkable formulas for 7, and to decide whether certain series 
are convergent or divergent. 
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The basic inequality rules for integrals are as follows. 


Theorem F57 Inequality Rules 
Let f and g be integrable on Ja, b]. 
(a) If f(x) < g(x), for x € [a,b], then 


b b 
oa 
(b) Ifm < f(z) <M, for x € [a,b], then 
b 
mo-a) < | o ey. 


Proof The proofs of parts (a) and (b) of Theorem F57 are illustrated in 
Figure 15. 


YA 


(a) 


Figure 15 The proofs of the Inequality Rules parts (a) and (b) 
(a) Let P be any partition of [a,b]. Since 
f(x) < glx), for x€ [a,b], 


the infimum of f on each subinterval of P is less than or equal to the 
infimum of g on that subinterval, and therefore L(f, P) < L(g, P). 
Thus 


b b 
/ fail. P)< mp l@,P)= | a 
a P P a 


since f and g are both integrable on [a, }]. 


(b) Since f(x) < M for x € [a, b], it follows from part (a), with g(x) = M, 
that 


f t< f mi=me-a) 


The proof of the left-hand inequality is similar. E 
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3 Inequalities, sequences and series 


The Inequality Rules allow us to estimate a complicated integral by 
evaluating a simpler one, as in the next worked exercise. 


Worked Exercise F36 


Prove the following inequalities. 


1 x 3 
——___ dr< ! log2 b =<] — 
(a) I 2— sinf r "SAE (b) v34 J-1V¥24+ 2° 


2 
dx <3 


Solution 


(a) 


Since 

|sin z| < |z|, forx ER, 
®. This is Corollary D46 from Subsection 2.3 of Unit D4. & 
it follows that 


sinf x < r$, for x eR. 


Thus 2 ~ sin q > 2-—2*>0 for x € [0,1], so 


z? 


— — fi Ome 
S=sn 2 2— 7°" ee Ue 


Hence, by Inequality Rule (a), we have 


1 3 1 3 
dg x 
o 2 = ginal ge 0 2=e 


1 
= [—4 log(2 — ae 
= —}(log 1 — log 2) 
= ; log De 


@. Here we have used equation (17) with g(x) = 2 — zt. ® 
Since the function x —> v2 + 2° is increasing on [—1, 2], we have 


1 < V¥2+2° < V34, for x € [-1,2], 
so 
1 il 
= 3 SS 
V34 V2+2 
Since the length of the interval [—1, 2] is 3, it follows from 
Inequality Rule (b) that 


L siege [=LA] 


2 
—<| ae 
V 34 -1V2+2° 
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Exercise F46 
Use the Inequality Rules to prove the following inequalities. 


3 1/2 
a xsin(1/x1?) dx < 4 b) os e? dx < le!/4 
1 ? 0 i 


We saw in Subsection 1.3 that if the function f is integrable on fa, b], then 
so is |f|. We now use the Inequality Rules to obtain an inequality 
involving the integrals of f and |f|, known as the Triangle Inequality for 
integrals. The name arises because of the similarity between this inequality 
and the Triangle Inequality for numbers: 


n n 
2 a| < $ lail. 
i=1 i=1 


Theorem F58 Triangle Inequality for integrals 
Let f be integrable on [a,b]. Then 


iG < fit. 


Furthermore, if |f (x)| < M for x € [a,b], then 
b 
ie 
Proof We know that, for all z € [a,b], 


-|f| < fæ) < |f). 
Since |f| is integrable on [a,b], it follows from Inequality Rule (a) that 


- fis firs fa 


which is equivalent to 


fi < fish 


Finally, if |f (x)| < M for x € [a,b], then, by the above inequality and 
Inequality Rule (b), 


E 


<M(b-a). 


b 
< | Ifl < M- a). 


3 


Sometimes we use the Inequality Rules and the Triangle Inequality in 
combination, as in the next worked exercise. 


Worked Exercise F37 


Prove that 


Solution 


By the Triangle Inequality for integrals, 


T/2 Er T2 = 
i x—m/2 T < [ x— 7/2 
0 2+cosz 0 22 COS GE 
T/2 J 
=| T/2-2 ix 
0 24,cosx 


@. For 0 < x < 1/2, we have 7/2 —x>0 and cosx>0. # 


Next, since 
2+cosx>2, for x € [0,7/2], 


we have 
1 


1 
> a a Ok 
Sen or eee 


Thus, by Inequality Rule (a) and statement (x), 


ee Na 
| Diao i; 3(5- 2) a 


IA 


II 
Ya NIe NIe 


= 
o>) 


Exercise F47 


Prove the following inequalities. 


4 sin(1/2) 
fi) | =t (b) 


v tanx 
s dx 
o 3-—sin(x?) 


Inequalities, sequences and series 


171 


Unit F3 Integration 


John Wallis 


172 


3.2 Wallis’ Formula 


In Worked Exercise F35 we used a reduction formula to show that if 


a /2 
as, sid” tdr, 9 SN 1 2s. 
0 


then 
—1 

hsz lii=l and [,= (7 ) Inn for n > 2. (18) 

We also remarked that by repeatedly applying equations (18) we obtain 
1 35 2n—1 7 

Jon S Be ee Se, OF >] 19 

an 2°46 i D oe ae 
and 

2 4 6 2n 
Í ESUE E Ff > 1. 20 
a= GG al e (20) 


Note that the formula for Jz, involves 7, but the formula for [9,41 does 
not. 


We now use these results, together with inequalities between various 
integrals of the form In, to establish two remarkable formulas for 7, the 
first of which is Wallis’ Formula. 


Theorem F59 


; 2 2 A ad G 2n 2n T 

(a) lim at a e ee OS aa a a E E = 

nm>o\l 3 3 5 5 7 2n—1 2n+1 2 
(Ge 2e 


Oa a 


John Wallis (1616-1703) was the most influential English 
mathematician before the rise of Isaac Newton. He was a 
cryptographer to the Parliamentarians during the Civil War and in 
1649 was appointed Savilian Professor of Geometry at Oxford. 


His most important work is his Arithmetica Infinitorum (Arithmetic 
of Infinitesimals), published in 1656. It was in this work that Wallis 
derived the formula which bears his name, and it was through 
studying this work that Newton came to discover his version of the 
binomial theorem. 


Wallis’ treatise on conic sections, published in 1655, contains the first 
publication of the symbol for infinity, oo. 


Part (c) of the next exercise gives an identity needed in the proof of 
Theorem F59. If you are short of time, then you may wish to skip this part 
of the exercise, which is quite challenging, and skim read the solution and 
also the proof of the theorem. 


3 Inequalities, sequences and series 


Exercise F48 


Forn = 1,2,..., let 
2244 6 6 2n 2n (nl)? 22” 
me eee ee + eee and bp = A, 
133 5 5 7 In—-1 Intl (Qn)l/n 
(a) Evaluate an and bn, for n = 1, 2,3. 
(b) Verify that 


2 1 
p= — ) an, for n = 1, 2,3. 


an 


(c) Prove that 


2 1 
2 = ( us ) an form = 152,522. 


n n 


Proof of Theorem F59 
Let 


T/2 
p= sin” gdz, n=0,1,2,..., 
0 


and let the sequences (an) and (bn) be as given in Exercise F48. 


(a) Using equations (19) and (20), we obtain, for n > 1, 


Tmo 1-3+3-5- +++ -(Qn-1W(Qn+1) er 1 2 
Iny 2 2+ 4+ 4+ +++ + (2) (2) De a 
so 
a Iny \ T 
” Ion, J 2 
Thus to prove part (a), it is sufficient to show that 
I 
ant? +1 asn— oo. (21) 
Ion 


We do this as follows. Since 0 < sinx < 1, for x € [0,7/2], we have 


2 2n+1 


sin” x > sin z>sin?”? r, for x € [0,7/2]. 


It follows by Inequality Rule (a) that 
Ton > I2n41 2 Lanse. 
Thus, on dividing by Jon, we obtain 


1> Tanti şs Ton+2 _ 2n+1 _ ey 
= lo, ~ Loin 2n+2 2+2/n 


by equation (18). On taking the limit as n — oo, we deduce that 
statement (21) holds, by the Squeeze Rule for sequences from Unit D2. 
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(b) We know, from Exercise F48(c), that 


Qn +1 1 
2 = ( eas ) ay = (242) an 
n n 


By part (a), 


T 
an > z asn > 00, 
so, by the Product Rule for sequences, 
T 
bn 2X ET as n —> OO. 


Hence, by the continuity of the square root function, 


bn > VT as n > oo. | 


3.3 The Integral Test 


In this subsection we introduce a method based on integration for 
determining the convergence or divergence of certain series of the 
0O 
form > f(n), where the function f is positive and decreasing and 
n= 


tends to 0. The method is based on the fact that it is often easier to 
evaluate an integral than a sum which has a similar behaviour. 


Theorem F60 Integral Test 


Let the function f be positive and decreasing on [1,0o), and suppose 
that f(z) > 0 as x > œ. Then 


f(n) converges if the sequence ( | s| is bounded above 
1 


f(n) diverges it [ f > œ as n > oo. 
1 


Remarks 
1. The Integral Test is also called the Maclaurin Integral Test. 


2. In both parts (a) and (b), the number 1 can be replaced by any positive 
integer. 


3 Inequalities, sequences and series 


Colin Maclaurin (1698-1746) was a Scottish mathematician who spent 

most of his career as professor of mathematics at Edinburgh 

University, having been appointed on the recommendation of Isaac 

Newton. A popular teacher, he was described as a ‘favourite professor’ 

and the ‘life and soul’ of the university. He is notable for having | 
extended Newton’s work on the calculus and geometry. His 
two-volume Treatise of Fluzions (1742), which was the first systematic 
treatment of Newton’s methods, contains a detailed discussion of 
infinite series and includes the Integral Test in verbal form. 


The Integral Test was rediscovered by Cauchy — he published it in 
1827 — and consequently it is also known as the Maclaurin—Cauchy 
Integral Test. 


Colin Maclaurin 
Proof of Theorem F60 For n=2,3,..., let 
Sn = f(1) + f(2) +---+ f(n) be the nth partial sum of the series 5 f(n) 


n=1 
and let P,-1 be the standard partition of [1, n] with n — 1 subintervals of 
length 1, that is, 


{[1,2],...,/é,¢+1],...,[n-—1,n]}. 

Since f is decreasing on [1,00), we have, for i = 1,2,...,n—1, 
mi = fli+1) and Mi= f(i). 

This is illustrated in Figure 16. 


Ya YA 


area = M; 62; 


= f(t) 


&Y 
p 
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>. 
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z3 

XY 


eiil en 
Figure 16 The lower and upper Riemann sums of f 


Also, each subinterval in the partition has length 1. Hence the lower and 
upper Riemann sums for f on [1,n] are 


L(f,P, et) -Emx 2ps -+ f(n) = 8, — fC) 
and 


U( i, Pat) -Fma 1) +--+ f(n- 1) = sn — f(n). 
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Since f is bounded and monotonic on [1, n], it follows from Theorem F48 


n 
in Subsection 1.2 that the integral [, = f f exists and satisfies 
1 


L(f,Pn—-1) < In < U(f, Pat), 
so 

5n — f(1) < In < sn — f(n). (22) 
We now consider two cases separately. 
Case (a): The sequence of integrals (In) is bounded above. 


In this case, there exists M € R such that 
Ia < M, for n= 2,3,.... 

It then follows from the left-hand inequality in statement (22) that 
sa IFOAM; fot n=2; 3 


Thus the increasing sequence (sn) is bounded above, so it is convergent, by 
the Monotone Convergence Theorem (Theorem D22 in Subsection 5.1 of 
Unit D2). 


x oo . 
Hence the series X f(n) is convergent. 
n= 


Case (b): The sequence (In) is not bounded above. 


The sequence (In) is increasing, since 


n+1 
hn-h=] $20, 
n 
so in this case 
In 2 œ asn —> oo. 
It follows from the right-hand inequality in statement (22) that 
Sy. Ins for N= 25352644 


Thus, by the Squeeze Rule for sequences which tend to infinity 
(Theorem D18 in Subsection 4.3 of Unit D2), 


Sn > œ as n — OO. 


Hence the series ane i f(n) is divergent. E 
n= 


In Theorem D33 of Unit D3 Series you saw that the basic series 
y (1 /n? converges for p > 2 and diverges for 0 < p < 1. We can now 
n= 


use the Integral Test to deduce the behaviour of this series for all p > 0. 
We first consider the case when p = 1. 


3 Inequalities, sequences and series 


Worked Exercise F38 


dz 
Use the fact that J= = log x to prove that D is divergent. 
z 


n= r 


Solution 
Let 


f(a)=—= (e€ [1,00)), 
Then f is positive and decreasing on [1,00), and 
f(x) 30 asx ow. 


Also, for n > 1, 


fi he 


[ log J" 


= logn > œ as n —> oo. 


Hence, by part (b) of the Integral Test, the series diverges. 


We now consider the series y~ i /n? when p > 0 but p #1. 
n= 


Worked Exercise F39 


Use the Integral Test to determine the behaviour of the series 


oe) 


1 
ie. forp>0, pA. 


n=l 


Solution 
Let p > 0 and p Æ 1, and let 


K = Wak (eea 
Then f is positive and decreasing on |1, 00), and 
f(x) 30 as z> o. 
Also, for n € N, 
n n i= 
PATa 


First suppose that p > 1. Then p — 1 > 0, so equation (*) gives 


E OEL PE 
1“ p= w= ~ p-1 


| -r 
l-p 
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Exercise F49 


Show that 


f e S 
r(logr)? log’ 
and hence prove that 


co 


1 
`> E is convergent. 


n=2 


4 Stirling’s Formula 


This section concerns the value of the quantity n! which arises in many 
problems in probability. You will see that integration techniques give an 
excellent estimate for n!, called Stirling’s Formula, which can be expressed 
as 

n! 


lim —————— = 1 
noo y2rn (n/e)” 


4.1 Comparing functions of n 


In this subsection we define a relation ~ on the set of all positive functions 
with domain N to provide a way of comparing the behaviour of such 
functions for large values of n. This relation is, in fact, an equivalence 
relation, although we do not prove this here or make explicit use of these 
properties. (You met equivalence relations in Unit A3 Mathematical 
language and proof and you may like to try to prove that ~ is an 
equivalence relation yourself; it is included as an exercise in the additional 
exercise booklet for this unit.) 


Definition 
For positive functions f and g with domain N, we write 
f(n) ~ g(n) as n —> co 


to mean 


f(n) 


———>]l anon. 


g(n) 


For example, 


n2 tnan? as n — oo, 


since n? +n > 0 and n? > 0, for n = 1,2,..., and 
n? +n 
ne 
Note that the statement 


=> l asn — o. 


f(n)~ g(n) as n —> c 


does not imply that f(n) — g(n) tends to zero or is even bounded. For 
instance, in the above example, (n? + n) — n? = n tends to infinity. Note 
that often we omit ‘as n — oo’ when writing expressions of this type. 


We have the following Combination Rules for ~. These rules follow from 
the Combination Rules for sequences; we omit the proofs. 


Theorem F61 Combination Rules for ~ 
If fi(n) ~ gi(n) and fa(n) ~ g2(n), then: 

Sum Rule film) + faln) ~ gi(n) + g2(n) 
Multiple Rule Afi(n) ~ Agi(n), for A € Rt 
Product Rule fi (n) fo(n) ~ gi(n)ga(n) 


fi(n) _ aln) 


Quotient Rule AT 
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Note that if l > 0, then the statements 
f(n) 391 as n —> oo 

and 
f(n)~l asn —> oo 


are equivalent, since each is equivalent to the statement 
f(n) 


= l as n — oo. 


4.2 Calculating factorials 


For small values of n we can evaluate n! directly by multiplication or by 
using a scientific calculator. 


Exercise F50 


Complete the following table of values of n!. 


n| on! n n! n n! 

1 6 720 || 20 | 2.432... x 1018 
2 7 5040 || 30 

3 8 40320 || 40 

4| 24| 9| 362880 || 50 

5 


120 || 10 | 3628800 || 60 


As n increases, n! grows very quickly, and is soon beyond the range of a 
calculator. Many calculations in probability theory involve n! for large 
values of n, so it is important to be able to estimate this quantity as 
accurately as possible. The following result gives us a way of doing this. 


Theorem F62 Stirling’s Formula 
n! ~ V2an(n/e)” asn— oo. 


We give the proof of this result in Subsection 4.3. 


James Stirling (1692-1770) was a Scottish mathematician who made 
many contributions to analysis. He was friends with Isaac Newton 
and Colin Maclaurin, and at Maclaurin’s request he provided help and 
criticisms when Maclaurin’s Treatise of Fluxions was in proof. 


Stirling’s most important work was his Methodus Differentialis, 
published in 1730. It focuses largely on infinite series and was directly 
stimulated by earlier work of Wallis and Newton. It includes a series 
for log n!, now usually written in the form known as Stirling’s 
Formula, given above. Earlier in 1730, Abraham de Moivre 
(1667-1754), famous for his work on the laws of chance, had published 
a similar formula for logn!, but he had not been able to find a precise 
value for the constant which Stirling showed to be V2r. 


Exercise F51 


Use a calculator to evaluate V27n(n/e)” for the following values of n. 
(a) n=5 (b) n=10 (c) n= 50 


As you saw in Exercise F51, even for small values of n, Stirling’s Formula 
gives reasonable approximations to n!, and the relative error (that is, the 
error expressed as a percentage of the true value) decreases as n increases. 


n n! Stirling’s approximation Relative error 

10 3628800 3 598 696 0.83% 

20 2.433 x 1018 2.423 x 1018 0.42% 

50 3.041 x 10% 3.036 x 1064 0.16% 
100 9.333 x 10457 9.325 x 10157 0.08% 


In fact it can be shown by a more careful argument that 


el/(12n+1) < i < el/(2n) for n > 1. 


V27n (n/e)” 
For example, if n = 10, then 


el/(12n+1) — 91/121 — 199829... and e!/(12") = ¢1/120 _ 1.00836... 


which indicates a relative error of about 0.8%, as shown in the above table. 
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Stirling’s Formula can be used to give estimates of probabilities. For 
example, if we toss n fair coins, then it can be shown that the probability 


1 ! 
of obtaining exactly r heads is n —, where my). __™ is the 
ry 2" r ri(n—r)! 


binomial coefficient which you met in Subsection 3.4 of Unit D1 (we do not 
prove this here). Therefore, if we toss 200 coins, then the probability of 
obtaining exactly 100 heads and 100 tails is 


Ge 1 200! 
100) 2200 ~ (700!)2 2200 

4007 (200/e)?°° 
(2007 (100/e)190)* 9200 
4007 (20020 /e200) 


~ 3007 (100200 /¢200) 2200 


— V4007 


2007 
1 
10./7 

1 


~ 17.724...’ 
perhaps rather higher than you might expect. 


~~ 


The following exercises give you a chance to practise using Stirling’s 
Formula. In some of these exercises you will need to use the Combination 
Rules for ~ from Subsection 4.1. 


Exercise F52 


Use Stirling’s Formula to estimate each of the following numbers (giving 
your answers to two significant figures). 


300\ 1 300! 1 
a WE (b) (1001)3 3300 


Exercise F53 


Use Stirling’s Formula to determine a number A such that 


& J r) ~ 22” as n— o. 
2n n 


Exercise F54 


Use Stirling’s Formula to prove that 


Hint: You can assume that if f(n) ~ g(n), then (f(n))!/" ~ (g(n))!/”. 
(This result about ~ holds because if n € N, then 


1/n 


a<a/" <1, for0O<a<1l, 


and 
1<a” <a, fora>l, 


by the rules for inequalities.) 


4.3 Proof of Stirling’s Formula (optional) 


The proof of Stirling’s Formula is quite long. If you don’t have time to 
study it properly, then you may find it interesting to skim through this 
subsection and see how integration is used to obtain an estimate of this 
type. 

The idea of the proof is to consider the graph of y = log x and a sequence 
of small sets which lie below y = log x and above the graph of a polygonal 
approximation to y = log x. We show that the areas of these small sets 
form a convergent series and deduce from this that the sequence 


n? +0/2) 


ün = — 
O ein!’ 


N= 253500565 
is convergent. Finally we use Theorem F59(b) to find the limit of the 
sequence (an), and this gives Stirling’s Formula. 
Proof of Theorem F62 We need to show that 
n! ~ V2nn(n/e)” as n —> oo. 
We begin the proof by considering the function 
f(x) = log z. 


For n = 2,3,..., let Pp_1 be the standard partition of the interval [1, n] 
with n — 1 subintervals: 


{[1,2],... [ii +1],..., [n — 1, n]}. 
Now consider the sequence (en) ae where cp is the total area of the 
segments which lie between the graph 

y=logz, 2x €([1,nl, 
and the polygonal graph with vertices 


(1,0), (2,log2), (8,log3), 2.5 (n,logn), 
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as illustrated in Figure 17. This set consists of n — 1 thin segments. 


®. The function f is concave (that is, its derivative is decreasing), so the 
line segment joining 


(i, logi) to (+ 1, log(t + 1)) 
lies below the graph y = log x. ® 


total area of contribution 
segments = Cy, tO Cn 


(i + 1, log(é + 1)) 


l (i, log i) 


12 Seait1 om = 
Figure 17 An approximation for the area under the graph of log x 


The area of the set between y = log x, x € [1,n], and the z-axis is 


/ log z dx = [x log x —x]) 
1 


= nlogn — (n — 1). (23) 
The area between the polygonal graph and the z-axis is 
s(L(f, Paai) +UI Pat) (24) 


®. This is because, on each subinterval [i,i + 1], the function f attains its 
minimum when x = į and its maximum when x = į + 1, so the area of the 
trapezium whose base is the subinterval is equal to the average of these 
two numbers. ® 


Since f is increasing, we have 


L(f, Pr-1) = log1+log24+.---+log(n — 1) 


= log(n — 1)! 
= log(n!/n) 
= logn! — logn (25) 
and 
U(f, Pa-1) = log 2 + log3+---+logn 
= logn!. (26) 


Substituting from equations (25) and (26) into equation (24), we find that 
the area between the polygonal graph and the z-axis is 


5 (log n! — logn + logn!) = logn! — 5 log n. (27) 


4 Stirling’s Formula 


It follows from equations (23) and (27) that 


Cn = nlogn — (n — 1) — logn! + 5 logn 


nrt(1/2) 
= log EEr 
e n: 


PORN . l (1, log 2) (2, log 2) 

The sequence (cn) is positive and increasing. Also, 

Cn <log2, forneEN, 
since the n — 1 segments which contribute to the area cn can be translated 
so that they all lie (without overlapping) in the triangle with vertices 
(1,0), (1, log 2) and (2, log 2), as illustrated in Figure 18. This geometric (10) (2,0) 
property holds because the function log is concave. It follows from the i , 
Monotone Convergence Theorem (Theorem D22 in Subsection 5.1 of Figure 18 The segments of 
Unit D2) that the sequence (cn) is convergent. area Cy 


Next, since the exponential function is continuous, the sequence 
ün = Oy = Lyre 


is convergent also. Thus 


nrt(1/2) 


in = => L asn >œ, (28) 


enin! 
for some non-zero number L. 


To find L, we consider the quotient 


a2 n2n+1 / (2n)2r+(1/2) (2n)! ni/2 


yw a 


ee x 
am €2"—2(nl)2 


e 
enm) (m22 ^ V/9" 
We now let n > œ in this equation. We have 

(2n)! nt? 1 
An > L, am > L and (mm a” 
by Theorem F59(b). 


®. Recall that Theorem F59(b) says that 
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Thus we can rewrite statement (28) in the form 
nrt+(/2) 
a er a as n > ©. 
Hence, by the Combination Rules for sequences, 
e”n! 
ann 


which can be rearranged to give Stirling’s Formula: 


n! ~ V2rn(n/e)” as n —> ov. E 


=> V27 ano o, 


Summary 


In this unit you have studied the formal definition of what it means for a 
function f to be integrable on a closed interval [a,b]. You did this by 
studying lower and upper Riemann sums for f, which give lower and upper 
estimates for the area between the graph of f and the x-axis for a < x < b, 
by approximating the area with a sum of areas of rectangles. These 
rectangles are based on a collection of subintervals of [a,b] known as a 
partition of [a,b], and you saw that, by decreasing the length of these 
subintervals (that is, the mesh of the partition), we get increasingly 
accurate estimates for the area. If the supremum of the lower Riemann 
sums is equal to the infimum of the upper Riemann sums, then we say that 
the function f is integrable on [a,b] and the common limit is the integral 
of f. You also saw that to prove that f is integrable it is sufficient to 
consider the sequence of standard partitions P, where the interval is 
divided into n subintervals of equal lengths. 


You then met the Fundamental Theorem of Calculus and saw that 
integration can be thought of as the process of finding a primitive, and 
hence as the opposite of differentiation. You studied a number of 
techniques for finding primitives, including integration by substitution, 
integration by parts and the use of a reduction formula. 


You also met some useful inequalities that enable us to estimate various 
integrals when it is not possible to find an exact value. You saw how these 
can be used to prove Wallis’ Formula, an approximation for 7, and studied 
the Integral Test which gives a method based on integration for 
determining the convergence or divergence of certain series. Finally you 
met Stirling’s Formula which gives an approximation for n! and is proved 
using upper and lower Riemann sums. 


Learning outcomes 


Learning outcomes 


After working through this unit, you should be able to: 


e determine the lower Riemann sum L(f,P) and the upper Riemann sum 
U(f, P) for a given function f and partition P 


b 
e understand the definition of the integral f f 
a 
e use upper and lower Riemann sums to determine whether a given 
function is integrable 
e use basic rules for manipulating integrals 
e state various sufficient conditions for a function to be integrable 


e explain what is meant by a primitive of a function and understand the 
Fundamental Theorem of Calculus 


e use the Fundamental Theorem of Calculus and the table of standard 
primitives to evaluate certain integrals 


e use integration by substitution and integration by parts 

e use the reduction of order method to evaluate certain integrals 
e determine lower and upper estimates for given integrals 

e state Wallis’ Formula 


e use the Integral Test to determine the convergence or divergence of 
certain series 


e understand the connection between integration and Stirling’s Formula 
for n! 


e use Stirling’s Formula to determine the behaviour of certain sequences 
involving factorials. 
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Table of standard primitives 


cae (ce +1) 
rett Hey + 1) 


Primitive F(x) 


a” /loga 


— COS T 


sin £ 


log (sec x) 


e? 


log x 

log |z| 
xlogz-— zr 
cosh x 


sinh x 


log (cosh x) 


(a? _ ay, a#0 


(a? — a?)-¥/2, a £0 


(a? +2?) "2, @0 


(a? eet a#0 
(£2 — a?) "2, a#0 
(a2 +x)", a0 


e°” cosbz, a,b #0 


e°” sin bx, a,b #0 


log(a + (x? — a?) t?) 
cosh” !(a/a) 


log(a + (a? + «?)1/2) 


tz(a? — pe 5a” sin! (x/a) 

5u(u? =a) ža? log(x + (x? — a?) !/?) 
5x(a" aes 5a" log(a + (a? + #?)1/?) 
a? + b? 


ax 


(acos bx + bsin bx) 


(asin br — bcos bx) 


a? + b2 


Solutions to exercises 


Solution to Exercise F33 
(a) The graph of f is shown below. 


= 0 T 


First, min f = 0, since 

1. f(x) > 0, for all x € [-1, 1], 

2. f(0)=0. 

Next, inf f = 0, since f has minimum 0 on [—-1, 1]. 

Also, sup f = 1, since 

1. f(x) < 1, for all x € [-1, 1], 

2. if M’ < 1, then M’ is not an upper bound for f 
on [—1,1] because the sequence (1 — 1/n) is 
contained in [—1, 1] and 

f(1—1/n) = (1-1/n)? 51 asn > o, 
so there exists x’ € [—1,1] such that f(a’) > M’. 
(Alternatively, you could consider f(—1 + 1/n).) 


Finally, max f does not exist, since there is no 
point x such that f(x) =1. 


(b) The graph of f is shown below. 


First, inf f = —1, since 


1. f(x) > —1, for all z € [—1, 1], 


Solutions to exercises 


2. if m’ > —1, then m’ is not a lower bound for f 
on [—1, 1] because the sequence (1/n) is 
contained in [—1, 1] and 

f(1/n) = (1/n)? —1 > -1 asn >œ, 
so there exists x’ € [—1,1] such that f(a’) < m’. 

Next, min f does not exist, since there is no point 

x in [-1,1] such that f(x) = —1. 

Also, max f = 1, since 

1. f(x) <1, for all x € [—1, 1], 

2. f(—1) =1. 

Finally, sup f = 1, since f has maximum 1 

on [—1, 1]. 


Solution to Exercise F34 
The interval [—1, 2] has length 3; hence the 
4 subintervals in this standard partition P of 


[—1, 2] must each have length #. Thus the required 
standard partition of [—1, 2] is 


P ={[-1, -3b l-i b [a4], [32] 
The mesh of P is the common length of the 
subintervals, namely 3, 


Solution to Exercise F35 


The function and the partition are illustrated 
below. 


eZ 
QOQONONTDS MN) 
SONS 


LLKR 


KAAVAA VYANA Xʻ“A XSA X 


5 2 


For the three subintervals in P, we have 


mı = 0, Mı = f(0) = 1, bx, =}, 
me = f(3)=2, Mo=f(4)=1, 6x =H, 
ms = f (3) =1, M3 = 2, 6x3 = 5. 


(Note that mg is also equal to f(1).) 
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Then 


3 1 
=la Ta 
_ əl 
~ 50 


Solution to Exercise F36 
Let 


f(z) = zx, TE [0,1], 
and let P, be the standard partition of [0, 1], 


a- (pa pa} 


Since f is increasing we have, for i = 1,2,...,n, 


m=i) -= 
“=(= 


and 


OX; = oe 
n 
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and also 


U(f, Pn) = $ Midas 


= 1. n(n+1) 
on 2 


(Here we have used the fact that the sum of the 


arithmetic series 1 +2+---+ 7 is 


each n € N.) 


Then, as n — oo, we have 


lim L(f, Pa) ae and = lim U(f, Pn) 
n— o0 N—+00 


2 


In particular, 


BEREE 


and hence 


f+- e= 


1 
1 
Thus f is integrable on [0,1] and f f= 7 
0 


, for 


z 


Solution to Exercise F37 
(a) The graph of f is shown below. 


YA 

34 e 

0 i 2 
—2 


Let P,, be the standard partition of [0,1] into n 
equal subintervals: 


(oaae 


For i = 1,2,...,n, we have 
m; =inf{ f(x): (i—1)/n < x <i/n} = —2. 
For i = 1,2,...,n — 1, we have 


M; = sup{ f(x) : (i — 1)/n < x <i/n} = —2. 


Also, Mn = 3 and 


1 
6a, =; fori=1,2,...,n. 
n 


Hence 


Lt) = Som OX; 
i=l 


EC 


i=1 


n-1 


i=l 


5 
=-2+-—>4-2 ano. 
n 


Solutions to exercises 


Since ||P,,|| — 0 as n > oo, and 
lim L(f, Py) = lim U(f, Pa) = —2, 
Noo Noo 


it follows from Theorem F45 that f is integrable 
on [0,1] and 


ee 


(b) The graph of f is shown below. 


YA 
H Kd 
0 1 2x 


Let P, be the standard partition of [0,1] into n 
equal subintervals: 


Tesni 


Each subinterval 


li 
$ +, for t= 1,2,...,n, 
n n 


contains both rational and irrational points: 


at the rational points, f(x) = zx, 
at the irrational points, f(x) = 0. 


Hence, for i = 1,2,...,n, 
m,=0O and M=— 
n 
Also, 


ih 
6a, =—-, fori=1,2,...,n. 
n 
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Hence Solution to Exercise F39 
(f, Pn) = y ee (a) From the Fundamental Theorem of Calculus 
= E and the table of standard primitives, we deduce 
n 1 that 
= y (0 x 9 = 0, 4 
i=1 ý f (x? + 9)? dx 
y 0 
4 
U(f, Pa) = >| Mion; = [ea Da a a 
i=1 


= 10 + 2log9 — 3 log 3 


= i 1 9 
=A mT = 10 + 5 log 3. 


i=1 
1 (b) From the Fundamental Theorem of Calculus 
= z2 a and the table of standard primitives, we deduce 
i=1 that 
1 n(n+1) e 
=a 2 f log x dz = [xloga — x]{ 
_i!,1 41 ' =(e-e)-(0-1)=1 
= 3 a 6 as n — Oo. : 
Since ||P,|| > 0 as n > oo, but Solution to Exercise F40 
lim LP) = 07 5 = lim U(f, Pa), Using the table of standard primitives and the 
ee aces Combination Rules, we obtain the following 
it follows from Theorem F46 that f is not primitives. 


integrable on (0, 1]. (a) F(x) = 4(x log x — x) — 2(5 a) 


Solution to Exercise F38 = 4(xlog x — x) — tan“! (x/2) 
e2t 
(a) We have (b) F(x) = 2(} log(sec 3x)) + Pep 5 (2cos x + sin x) 
1 
F'(x) = ote aie (1 + 3(x? — cr 2) = $ log(sec3x) + že?” (2 cos x + sin x) 
(a? — 4)-1/2 (2? o 4)1/2 4 z) (These results can be checked by differentiation.) 
F 2_ 4yi/2 7 $ 
x + (a? — 4) Solution to Exercise F41 
= (x? — 4)? f 
Fa) (a) Take u = sin 3x; then 
= f(z 
i du 
as required. T 3cos3z, so du = 3cos3z dz. 
(b) We have Hence 
F'(£) = ES cosh x sintim 32) cossadr = b fsmudu 
sinh* x 
= coshx =—icosu 
cosh? z = —4 cos(sin 3x). 
~ a (b) Taking u = 2 + 3x3, we obtain 
= i 5 
as required. Jre + 32°)" dz = 5.2 + 32°)°. 
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(c) Taking u = 2x7, we obtain 
J esine dx = —} cos(2x°). 
(d) Using equation (17), we obtain 


fare + 827) dx = # log(2 + 3x7). 


Solution to Exercise F42 
Let u = 1 + e”; then 
du _ 
dz 
Also, 


e”, so du=e’ dz. 


whenx=0, u=2, 
whenx=1, u=1+e. 


Hence 


i e7 a du 
——— dr = 2 
» tee w 
7 “j 1+e 
=al 
1 1 e—1 
l+e 2 
Solution to Exercise F43 
(a) Let u = (x — 1)!/?, so x = u? + 1; then 


ai = 2u, so dx = 2udu. 
du 
Hence 
f da 
3(x — 1)3/2 + a(x — 1)1/2 
B f 2u d 
© J 3u + (u? + Iu “ 
= f PE 
4u? +1 
= 2 | _ du 
(2u)? +1 
= tan“! (2u) 
= tan™t(2(x — 1)!/?). 


(b) Let u = V1 +F e7, so x = log(u? — 1); then 


dr 2u Gu 2u 
a. P ee 


du. 


Solutions to exercises 


Also, 


when zx = 0, u= v2, 
when zx = log3, u=2. 


Hence 
log 3 2 ou 
f evTFEds= | (u? — 1)u 5 du 
0 y2 uł — 1 
2 
-| 2u? du 
V2 
2 
= [28 
7 [ĝu lz 
= (16 — 4V2)/3. 


Solution to Exercise F44 

(a) Here we use integration by parts, with 
fe)=lgr and g/(x) = 2"; 

then 
f'(z)=1/az and g(z)= $e A/3 


Hence 


Ja" log z dz = 3 74/3 log r — gf ao dx 


= 34/3 log r — i [ar 
= 374/3 log x — 2gs, 


(b) We use integration by parts twice. On each 
occasion we differentiate the power function and 
integrate the trigonometric function. 


We have 
T/2 
f x? cos z d£ = Es sin x] an 
0 
T/2 
-f 2x sin x dx 
0 
2 T/2 
a 2 | xsinx dx 
4 0 
and 


T/2 1/2 
f xsin x dz = |x(— cos x)] 
0 


a /2 
-f (— cos x) dx 
0 
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It follows that 


T/2 2 
f t cosrdi = 2 
0 4 


Solution to Exercise F45 


1 
(a) =f e” dz = |e"), =e-1. 


0 
(b) Using integration by parts, we obtain 


=e—nln-1, forn>1. 


(c) Using the solution to part (b) with 
n = 1,2,3,4 in turn, we obtain 


I =e-Ih=e-(e-1)=1, 
Ig =e —-—2h, =e-2, 
Iz =e — 3h =e — 3(e — 2) = 6 — 2e, 
Ig = e — 4I; = e — 4(6 — 2e) = 9e — 24. 
Solution to Exercise F46 
(a) Since sin(1/x!?) < 1, we have 
asin(1/x1°) < x, for æ € [1,3]. 


Thus it follows from Inequality Rule (a) that 


3 3 
f xsin(1/x!?) dz < f rdr 
1 1 


(b) If x € [0,5], then 


t= < e% < e1/2)? 2 en, 


5 Boa a . 
because the function x — e” is increasing on 


[0, 5]. 


Thus it follows from Inequality Rule (b) that 


dle 


1m 
< | e” dz < letia 
0 
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Solution to Exercise F47 
(a) Since 

|sin(1/az)| < 1, for x € [1,4], 
and 

2+ cos(1/z) > 1, for x € [1,4], 


it follows that 


in(1 
ee <1, forze [1,4]. 
2 + cos(1/z) 
Hence, by the Triangle Inequality for integrals, 
i 1 
f _sin(1/z) © jsf meds 
1 2+cos(1/x) 
(b) Since 


tang > 0, for x € [0,7/4], 
and 

3— sin(x?) > 2, for « € [0,7/4], 
it follows that 


tan g 
Dema eres rE 


Hence, by the Triangle Inequality for integrals and 
Inequality Rule (a), 


f tang d T tang 
— H — T 
9  3-—sin(z?) ia 9 |3—sin(2?) 
T/4 
<f 5 tan xdg 
0 
T/4 


[5 log(sec x)] 5 
= $(log(sec 7/4) — log 1) 
= 1 log(V2) 


a 7 log 2. 


Solutions to exercises 


Solution to Exercise F48 It follows that 
2n)!)? (Qn +1 
(a) We have i= gemini? {Ont One) 
22 4 22n (n!) 
ee a a 
2244 64 ((2n)!)?(2n + 1) 
ae eo - (<2) 
2 24466 256 "\2n+1 
a3 = =. — = —— 
3° 13°35 5 T 175° Hence 
b = net 5 > (2n+1 
avr Ng, pe 
294 
bə = (2!)*2 = 4 2, as required. 
Al,/2 3 
7 (31)226 16 z Solution to Exercise F49 
35 = zp V9: 
61/3 15 Let u = log x; then 
(b) Using the results of part (a), du 1 
—=-, so du= — 
b? =4 = 3a1, dx x 
2 32 5 Hence 
b3 = pl 
9 2 f dr — fdu 1 1 
p2 256 _ i. x(log x)? u? u log z` 
3 e , 
T5 3 Now let 
as required. 1 
(c) We have f(x) = x(log x)? (x € [2, 00)). 
2 (lynn Then f is positive and decreasing on [2, 00), and 
no N20." 
VAn n f(x)— 0 as z> o. 
We now try to express 
Also, for n > 2, 
2-2-4-4.-.-- + (2n) (2n) 
Un ee g e HINT n u dx 
[:3.:3-5- = -(n—1)0Qn +1) eS) _ a 
2 2 x(logx)? 
in terms of factorials. 17” 
The numerator is a product of 2n even numbers. z | log -| 5 
Taking a factor 2 from each term, we deduce that 1 1 1 
= = < , 
2:2-4-4..--. - (2n) (2n) log2 logn ~ log2 


Since the sequence of integrals ( | f) is 
2 2 
bounded above, it follows from part (a) of the 


The denominator of a, cannot be treated in quite 
the same way, as all its factors are odd. To relate it 


to factorials, we introduce the missing even factors: Integral Test that the series a 1 > 
e E e e E A. 4 n(log n) 
_1-2-2-3-3-4-4- ++. - (Qn — 1)(2n)(2n)(2n +1) 
7 2-2-4-4.... «(2n)(2n) 
_ ((2n)!)? (2n + 1) 
228 (al)? 
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Solution to Exercise F50 


The values are as follows. 


n n! 


30 | 2.652... x 1082 
40 | 8.159... x 104" 
50 | 3.041... x 1064 
60 | 8.320... x 108! 


Solution to Exercise F51 


The values are as follows. 
v 2rn (n/e)” 


5 | 118.019... 
10 | 3.598... x 10° 
50 | 3.036... x 1064 


Solution to Exercise F52 


In each part we approximate the factorials using 


Stirling’s Formula. 
(a) (300\ 1 _ 300! 
150/ 2300  150!150! 2300 
6007 (300/e)300 


(3007 (150/e)#50) ? 2300 


v 6007 
3007 


~ 30/7 
= 0.046 (to 2 s.f.). 


6007 (300/e)300 


(100!)3 3300 ~ (2007)3/2 (100/e)300 3300 


= 0.0028 (to 2 s.f.). 
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Solution to Exercise F53 
We have 


Gai = -E 


An / 2n\ _ (4n)!(n!)? 

an} / \n) ~ (Qn 
By Stirling’s Formula, and the Product and 
Quotient Rules for ~, we obtain 


(an) / (7) „ Ban (an fey (Vann (r/e) 
2n (dan (2n/e)2n)° 
B V8rn 44" Irn 
E (4rn)3/2 26n 
9/8 44" 
~ g gon 
L gn, 


V2 


sO 


n 


Hence \ = 1/72. 


Solution to Exercise F54 


Using Stirling’s Formula, we obtain 


n n 


n e 


nm Vrn (nje) yrn 
Thus, by the hint, 


n\ 1/n e” 1/n g 
Gi) i ( z 7 (Van) /” nin 
We know (from Worked Exercise D26 and 


Exercise D34 in Subsection 3.3 of Unit D2) that, 
for any positive number a, 


a!” +1 asn—oo 
and 
n/™ +1 asn— oo. 


Hence 


(vz) +1 as n — oo 


and 


ni/n +1 asn—- oo. 


Solutions to exercises 


It follows that 


n” 1/n 
( ) ~e an oO; 


n! 


that is, 
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Unit F4 
Power series 


Introduction 


The evaluation of functions is of great importance. If you are dealing with 
a polynomial function, then the calculation of its values is just a matter of 
arithmetic. For example, if 


iiaj=14 $x lx? tr’ | tat, 


then 
f 1 1 1 1 13 


On the other hand, the sine function is different. There is no way of 
calculating most of its values exactly just by the use of arithmetic, but it is 
important to be able to estimate them accurately because they arise in the 
solutions of many practical problems. 


This unit is concerned with a procedure for calculating approximate values 
of functions, like the sine function, which cannot be found exactly. In 
studying this procedure, you will use many of the ideas and results you met 
in earlier analysis units, especially those relating to sequences and series. 


You will see that a certain sequence of polynomials, known as Taylor 
polynomials, can be used to calculate approximate values of functions to 
any desired degree of accuracy, and that many functions can be 
represented as a sum of a convergent series of powers of x, known as a 
Taylor series. For example, the polynomial p(x) = z — «3/6 approximates 
f(x) = sing to within 5 x 1076 for all x in the interval [0,0.1], and we can 
represent the sine function by the following convergent series: 


z3 rž x’ 


sing=e-—ate oat for x ER. 
As an application of these ideas, we show later in the unit how a method 
based on Taylor series can be used to estimate 7 to any desired number of 


decimal places. 


1 Taylor polynomials 


In this section you will meet the definition of the Taylor polynomial Ta (zx) 
at a point a of a function f, and study several particular functions for 
which Taylor polynomials appear to provide good approximations. 


1.1 What are Taylor polynomials? 


Let f be a function defined on an open interval J. Throughout this unit, 
we assume that a is a particular point of J and seek polynomial functions 
which provide good approximations to f near the point a. 


If f is continuous at a, then the value f(a) is an approximation to the 
value of f(a) when z is near a, by the definition of continuity; that is, 


f(x)» f(a), for x near a. 


1 Taylor polynomials 
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Figure 1 Approximating f(x) 
by f(a) 


n y= fla) + f'(a)(x— a) 


Figure 2 The tangent 
approximation at a to f 
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In geometric terms, this means that we can approximate the graph 

y = f(x) near a by the horizontal line y = f(a) through the point (a, f(a)), 
as shown in Figure 1. Usually this does not give a very good 
approximation. 


However, if the function f is differentiable on J, then we can obtain what 
is usually a better approximation by using the tangent at (a, f(a)) instead 
of the horizontal line, as shown in Figure 2. We can think of the tangent 
at (a, f(a)) as the line of best approximation to the graph near a. The 
tangent to the graph at (a, f(a)) has equation 


y- f(a) _ 


TtT—a 


f'(a), thatis, y= f(a) + f'(a)(x-— a). 
So, for x near a, we can write 
f(x) = f(a) + f'(a) (z — a). 
This approximation is called the tangent approximation at a to f. 
Note that the function f and the approximating linear function 
z f(a) + f'(a)(z — a) 


have the same value at a and the same first derivative at a (that is, their 
graphs have the same gradient at a), so this gives a better approximation 
to f near a than the constant function f(a) when the gradient of the graph 
is non-zero. 


Worked Exercise F40 


Determine the tangent approximation to the function f(x) = e” at the 
point 0. 


Exercise F55 


Determine the tangent approximation to each of the following functions f 
at the given point a. 


(a) fej=e,, @=2. (b) f(x)= cosx, a=: 


So far we have seen two approximations to f(x) for x near a: 
f(x) = f(a) (a constant function), 
f(x) = f(a) + f'(a)(x—a) (a linear function). 


If the function f is twice differentiable on J, then we can consider the 
quadratic function 


p(x) = f(a) + f'(a)(z - a) + 5 f"(a)(@ — a)’, 
which is chosen to satisfy p(a) = f(a), p'(a) = f'(a) and p’(a) = f" (a), as 
you can check by differentiating p to give 

p(x) = f'(a) + f"(a)(a@—a), and p(x) = f"(a). 


It is plausible that, for x near a, 
f(x) © f(a) + Fe- a) + 5f"(a)(@ — a)? 


will usually be a better approximation to f near a than the constant or 
linear approximations. More generally, if the function f is n-times 
differentiable on J, then we can find a polynomial of degree n whose value 
at a and first n derivatives at a are equal to those of f, and this 
polynomial will usually provide an even better approximation. 


Definition 
Let f be n-times differentiable on an open interval containing the 


point a. Then the Taylor polynomial of degree n at a for f is 
the polynomial 


Mag (n) 
Talg) = fla) + Faye —a) + FO @ — a)? +--+ 1 ea 
that is, 
n 2h) (q 
Le = f a Lee : 
k=0 
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Remarks 


1. Of course, the Taylor polynomial depends on the point a and the 
function f as well as on n and x, but for brevity our notation T;,(z) 
does not reflect this. Usually a and f will be clear from the context. 


2. The coefficients in the definition of T, have been chosen so that 


Lea=fe, Meera, an a= a 


Thus the functions f and Th have the same value at a and have equal 
derivatives at a for all orders up to and including n, in the same way as 
for the linear and quadratic functions above. Indeed, To(x), Ti (£) 

and T(z) are respectively the constant, linear and quadratic 
approximations at a to f discussed previously. 


3. It follows from the definition that Taylor polynomials for successive 
values of n satisfy the recurrence relation 


f(a) 


TAG) = Tn—1(2) + = 


(zx-a), n>1. 


4. A Taylor polynomial has degree n if it is based on derivatives of f up to 
order n. This use of the word ‘degree’ differs from the usual definition 
of the degree of a polynomial, which is the largest exponent in the 
polynomial. 


5. Some texts refer to a Taylor polynomial about a instead of at a. 


Worked Exercise F41 


Determine the Taylor polynomials Tı (x), T(x) and T3(x) at the following 
points a for the function f(x) = sin z. 


(a) a=0 (b) a=7r/2 
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(a) Hence, at a=0, 
Ti(z) = f(0) + f'(0)z = x 
To(x) = T + £0) 2 = 


®. Note that Tə(x), the Taylor polynomial of degree 2 at 0, is a 
polynomial of degree 1 because f”(0) = 0. .® 


(b) At a=7/2 we have 
Ti(0) = f(n/2) + f(t/2\(a — 7/2) =1 


mo =Ty(2) + EP e ra 
=i o 7/2)? 
ir 

Tala) = Tola) + P e — 1/2) 


®@. We do not usually multiply out brackets in such Taylor 
polynomials, since that would make the results less clear. © 


Exercise F56 


Determine the Taylor polynomials Tı (x), Tə(x) and T3(a) for each of the 
following functions f at the given point a. 


al J(=; 2=—2. (b) JQ) =cose, c=. 


Exercise F57 


Determine the Taylor polynomial of degree 4 for each of the following 
functions f at the given point a. 


(a) f(x) =log(l+z), a=0. (b) f(x)=singz, a= 7/4. 


(c) f(x) =14+ $2 — $2? - $r? + irt, a=0. 


Exercise F58 


Let Tz be the Taylor polynomial of degree 3 at 0 for f(x) = sin x, as 
calculated in Worked Exercise F41(a). Use a calculator to show that 


| sin(0.1) — 73(0.1)| < 1 x 1077. 


(Remember to set the calculator to use angles in radians.) 
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1.2 Approximation by Taylor polynomials 


We now look at two specific functions in order to investigate the assertion 
that Taylor polynomials provide good approximations for a large class of 
functions. 


The function f(x) = sin x 


In Worked Exercise F41(a) we found the Taylor polynomials Tı (x), T(x) 
and 73(a) for the function f(x) = sinz at the point a = 0. By calculating 
higher derivatives of f at 0, we can show that the Taylor polynomials of 
degrees 1,2,...,8 at 0 for f are as follows. 


ZL 
T(x) = T(z) = 2, T(x) = Ty(z) = x ar 
z’ gř LB x rx 
Tasman P AR E 


The graphs in Figure 3 illustrate how the approximation to f(x) given by 
Ta (x) improves as n increases. 


Yr g= Ta) YR 


Figure 3 The graphs of Taylor polynomials at 0 for f(x) = sin z 


For example, the graph of Ts appears to be very close to the graph of the 
sine function over the interval (—7/2,7/2), so T(x) seems to be a good 
approximation to sin x in this interval. 


It also appears that, as the degree of the Taylor polynomial increases, the 
interval over which its graph is a good approximation to that of the sine 
function becomes longer. For instance, in the graphs in Figure 3, the 
shaded area covers the interval of the z-axis on which the Taylor 
polynomial Tp(x) agrees with sin x to three decimal places. 


Worked Exercise F42 


Determine the Taylor polynomial of degree n at 0 for the function 
f(z) = sinz. 


Solution 
We have 


f%@)=sinze, a 
and in general, for k = 0, 1,2,..., 
fono =o and o= i 


Hece, tor m = 0 l 2y 


3 j g2kt+l 
Tomy (2) = > (- i 
R (2k +1) 
r3 x’ emt 
Ey yay ed ope meee leg E eee 
aa a.) a 


and OT) = Torpa): 
So if n takes either of the values 2m + 1 or 2m + 2, then we have 


r3 x5 g2mrl 
Taye zoa (-1)™ 
ES 7 vale) Cm +1)! 
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The function f(x) = 1/(1 — x) 
By repeated differentiation of f(x) = 1/(1 — x), we can verify that 


k! 


Thus in particular f (k) (0) = k!, so the Taylor polynomial of degree n at 0 
for f is 


n fik) 
Tia = > J w of 
k=0 


k 


n 
=X st =1+r+? + +a". 
k=0 
Figure 4 shows the graphs of the Taylor polynomials of degrees 1, 2, 4 
and 7 at 0 for f. 


The graphs show that the nature of the approximation is very different 
from that in the previous example. For the sine function, the interval over 
which the approximation is good seems to expand indefinitely as the degree 
of the polynomial increases. For f(x) = 1/(1 — x), however, the interval of 
good approximation always seems to be contained in the interval (—1, 1). 


For this function f, the Taylor polynomials T;,(x) at 0 are the nth partial 
Co 
sums of the geometric series > x”. This series converges with sum 


n=0 
1/(1 — x) for |x| < 1, and diverges for |x| > 1, as you saw in Theorem D24 
in Unit D3 Series. Thus, if |x| < 1, then 


Talx) > f(x) as n > œ. 


So in this example we can prove that the polynomials T(x) provide better 
and better approximations to f(x) as n increases, but only if |x| < 1. For 
|x| > 1, the sequence (T;,(2)) does not converge, so increasing the value of 
n does not in general give a better approximation to f(z). 


y = T(x) 


y = T(x) 


Figure 4 The graphs of Taylor polynomials at 0 for f(x) = 1/(1 — x) 


In later sections we often need general formulas for certain key Taylor 
polynomials at the point a = 0; we ask you to find these in the next 
exercise. 


Exercise F59 


Determine the Taylor polynomial of degree n at 0 for each of the following 
functions. 


(a) f(z)=e (b) f(@æ)=log +s) (c) f(a) = cosx 
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r>a 
a C T 


Figure 5 The points a, x and 
c in Taylor’s Theorem 
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2  Taylor’s Theorem 


In this section you will investigate how closely the Taylor polynomials of a 
function f approximate f and see that, for many functions, f has a 
representation at the point x = a of the form 


n—> o0 


ie) => lim Talr) = ip an(z — a)”. 
n=0 


We then say that f is the sum function of a power series. 


2.1 Taylor’s Theorem 


In Section 1 we showed how to find the Taylor polynomial Ta (£) of 

degree n at the point a for a function f. This polynomial and its first n 
derivatives agree with f and its first n derivatives at a, and for larger 
values of n the polynomial appears to approximate f at points near a. The 
following fundamental result gives a formula for the error involved in this 
approximation. 


Theorem F63 Taylor’s Theorem 


Let the function f be (n + 1)-times differentiable on an open interval 
containing the points a and x. Then 


(n) 


F(a) = f(a) +O- a) t FID e — a)” + Rala) 
= Ta (z) + Ra(z), 


where T,,(x) is the Taylor polynomial of degree n at a for f and 
fo) (c) 
(se) 


for some c between a and z. 


R(x) = Ce Oa 


Remarks 


1. Figure 5 shows the possible relative positions of the points a, x and c. 


2. Since Taylor’s Theorem can be expressed in the form 
f(x) = T(x) + Rn(z), 


we say that R(x) is a remainder term, or error term, involved in 
approximating f(x) by T;,(a). The formula for R,,(x) involves an 
‘unknown number’ c, so it does not specify the remainder term R,,(zx) 
exactly. Nevertheless, we can often use it to show that T,,(x) is a good 
approximation to f(x). Note that, as with the notation T,,(x), our 
notation R,,(x) does not reflect the fact that the remainder also 
depends on both f and a. 


3. When n = 0, Taylor’s Theorem reduces to 
f(z) = f(a) + F’ (c)(2 — a), 


for some c between a and zx; that is, 


Fe) = F@) _ px) 


z-a 
for some c between a and x. But this is just the Mean Value Theorem 
that you met in Subsection 4.1 of Unit F2 Differentiation (see 

Figure 6). Thus Taylor’s Theorem can be considered as a generalisation 
of the Mean Value Theorem. 


4. The form of the remainder R,(x) given in Taylor’s Theorem is actually 
due to Lagrange. There are other forms, due to Taylor and Cauchy, and 
also the following neat formula which can be derived by repeated 
integration by parts (we do not prove this here): 


Reale) = 5 f(r see a 


Note that this form of the remainder does not involve any ‘unknown 
numbers’. 


Brook Taylor (1685-1731) was an English mathematician who in 1715 
published a slim volume entitled Methodus incrementorum (Method 
of Increments) which included the theorem that now bears his name. 
Taylor was the first to publish the theorem but he was not the first to 
discover it. At least five mathematicians anticipated him: 

James Gregory (1671), Leibniz (1670s), Newton (1691), 

Johann Bernoulli (1694) and de Moivre (1708). However, Taylor was 
the first to have appreciated the fundamental significance of the result. 


The first explicit expression for the remainder term in Taylor’s 
theorem was provided by Joseph-Louis Lagrange in 1797 in his 
Théorie des fonctions analytiques (Theory of analytic functions), the 
text in which he attempted to provide a sound foundation for calculus 
by reducing it to algebra and developing it on the basis of Taylor’s 
Theorem. 


Taylor was an accomplished musician and artist, and in his book on 
linear perspective, which was also first published in 1715, with an 
improved version in 1719, he enriched the theory of perspective in 
many respects. 


In Subsection 4.1 of Unit F2 we proved the Mean Value Theorem using 
Rolle’s Theorem, which you also met in Unit F2. This says that, if f is 
continuous on the closed interval [a,b] and differentiable on (a,b), and if 
f(a) = f(b), then there is a point c with a < c < b such that f’(c) = 0. We 
now use Rolle’s Theorem to prove Taylor’s Theorem. If you are short of 
time, then you may prefer to skim read this proof. 
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Figure 6 The Mean Value 
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Proof of Taylor’s Theorem In the proof we assume that x > a; the 
proof for x < a is similar. 


We consider the function 


h(t) = f(t) — Talt) — A(t -a)"™, (1) 


where Tn is the Taylor polynomial of degree n at a for f, and Aisa 
constant chosen so that 


h(x) = 0. (2) 
Now, by the definition of Ty, 

To= TOS or {"@=T Oe, 
so 

h(a)=0, hi(a)=0, ..., h(a) =0. 


Thus the function A is continuous and differentiable on an open interval 
containing a and x, and h(a) = 0 = h(x). So, by Rolle’s Theorem applied 
to h on the interval [a,x], there is a number cı between a and x for which 


h'(c1) = 0. 


Similarly, the function h’ is continuous and differentiable on an open 
interval containing a and c1, and h’(a) = 0 = h’(c,). Hence, by Rolle’s 
Theorem applied to h’ on the interval |a, ci], there is a number c2 between 
a and cı for which 


h” (c2) = 0. 
Applying Rolle’s Theorem successively to the functions 
WWD akg BO, 
on the intervals 
[a, ca], [a,c3], ---, [a, Cn], where c2 > c3 >--- > cn >a, 
we deduce that there is a number c between a and cp for which 
A+) (ce) = 0. (3) 


These points are illustrated in Figure 7. 


a Cc Ch ee C2 C1 p 


Figure 7 ‘The points a, c, c1, C2, ..., Cn and x 
By repeatedly differentiating equation (1), we obtain 
hO+D (a) = fOD) — A(n +1)! (4) 


®. Note that MO = 0 since Tn is a polynomial of degree at most n 
and differentiating such a polynomial n times gives a constant. ©& 


From equations (3) and (4), we deduce that 


(n+1) 
(n+) (e) — = _ fe") 
f (c)— A(n+1)!=0, so A (+i) (5) 
Finally, it follows from equations (1), (2) and (5) that 
(n+1) 
_ = n+1 __ f (c) = n+1 
f(a) = Tala) + Ala = a)" = Tala) + ea, 
as required. | 


Exercise F60 


By applying Taylor’s Theorem with n = 3 to the function f(x) = cos at 
a = 0, prove that, for x Æ 0, 
B ia, COSC 4 
cosx = 1 — 52° + 7I Ha 


where c lies between 0 and z. 


(The Taylor polynomial of degree n at 0 for cosx was found in 
Exercise F59(c)). 


The conclusion of Exercise F60 can be restated as 
COSC 4 
ay? 
where c lies between 0 and x. Here we do not know the exact value of c, 
but we do know that |cosc| < 1. Thus we can deduce that 


cosg — (1-327) = 


__ |cose| |x|? 
© 4 4l 
In this way, we obtain an explicit remainder estimate, or error bound, 
for the approximation of cosx by 1 — $x", which is small when zx is near 0. 


| cos x — (1 — $2”)| |z| < 


In general, we can obtain such an estimate for | f(x) — Ta (x)| = |Rn (x)| 
provided that we have an estimate for |f(+!)(c)| which is valid for all c 
between a and x. The following strategy sets out this process. 


Strategy F11 


To show that the Taylor polynomial T, at a for f approximates f toa 
certain accuracy at a point x Æ a, do the following. 


1. Obtain a formula for f(t, 
2. Determine a number M such that 


[FFD (c) < M, for all c between a and x. 


3. Write down and simplify the remainder estimate 


We) = Tn(x)| = |Rn(x)| < jie — al". 


M 
(n + 1) 
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Note that in step 2 we can use any convenient value for M, preferably not 
too large. Sometimes we can determine the maximum value of | f("+)) (c)| 
for c in the closed interval with endpoints a and z, and often this 
maximum value is taken when c is equal to either a or x. Usually, however, 
any ‘good enough’ upper bound for | f+ (c)| will do. 


Worked Exercise F43 


(a) Write down the Taylor polynomial T3(x) at a = 0 for f(x) = sin z. 
(b) Use Taylor’s Theorem to show that |sin(0.1) — 73(0.1)| < 5 x 1076. 


(c) Hence calculate sin(0.1) to four decimal places. 


Solution 
(a) For f(x) = sinz and a = 0, we have 


T(x) = £ — za. 


®. This expression for T3(x) was derived in Worked 
Exercise F41(a). @ 
(b) We use Strategy F11 with a = 0, z = 0.1 and n = 3. 
1. First, f® (z) = sina. 
2. Thus 
|f(c)| =|sine] <1, for ce [0,0.]], 
so we can take M = 1. 
®. With care a smaller value for M can be obtained. For 


example, using the Sine Inequality (Theorem D45 in Unit D4 
Continuity), we have 


|sinc| < |e] < 0.1, for ce [0,0.1]. 
However, here we take M = 1. © 


M 
3. Using the remainder estimate m+ x —a|"*!, we therefore 
n ! 
obtain 


|sin(0.1) — T3(0.1)| = |R3(0.1)| 


eae 
CESI 


1 4 
= 0.000 004 16 
= On 


as required. 
(c) By part (a), 
T3(0.1) = 0.1 — % x 107° 
= 0.1 — 0.000 166 66... = 0.099 833 33.... 


Exercise F61 


(a) Write down the Taylor polynomial 7T>(x) at a = 0 for 
f(x) = log(1+ x), using the solution to Exercise F59(b). 


(b) Use Taylor’s Theorem to show that |log(1.02) — T2(0.02)| < 3 x 10~®. 


(c) Hence calculate log(1.02) to four decimal places. 


Our next strategy shows how to use Taylor’s Theorem to obtain an 
approximation to f which holds at all points of an interval. 


Strategy F12 


To show that the Taylor polynomial T, at a for f approximates f toa 
certain accuracy throughout an interval J of the form [a,a + r], 
[a — r,a] or [a — r,a +r], where r > 0, do the following. 


1. Obtain a formula for f°), 
2. Determine a number M such that 
[FY (c)| <M, for allc el. 
3. Write down and simplify the remainder estimate 
M n+l 
T 
(n+1)! 


ae) ee) | ean le , ior alll ge € J. 


Strategy F12 is obtained from Strategy F11 by replacing |x — a| by r, since 
|x — a| < r for all values of x in the interval J. 


By applying Strategy F12 to the situation in Worked Exercise F43, we can 
show that the remainder estimate at the point x = 0.1 in fact holds over 
the whole interval [0, 0.1]. Here we have r = 0.1, M = 1 and n = 3, so 


1 
[sins — Ta(x)| < z 0.14 < 5x107, forall x € [0,0.1]. 


Here is another worked exercise. 
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Worked Exercise F44 


(a) Calculate the Taylor polynomial T3(x) at 1 for f(x) = 1/(# +2). 


(b) Show that 73(x) approximates f(x) with an error less than 5 x 1073 
on the interval [1, 2]. 


Solution 
(a) For this function, 
f(z) = 1/( + 2), PO) = 1a 
P=- P= 
f" (z) =2/( +2),  f"(1) = 2/27 
P= ee E E -2/27. 
Hence the Taylor polynomial of degree 3 at 1 for f is 
T3(x) =} -3(£-1)+ (z - 1? -— a(s- 1). 
(b) We use Strategy F12 with I = [1,2],a=1,r=1andn=3. 


24 
1 Fi OC) = —— 
ish; f° (a) (e+ 25 
2- Movs 
24 24 
Ol 2 ee a: 1,2 


so we can take M = 24/3°. 
@. Here we have used the fact that, since c > 1, we have 
e+2>142=3.@ 


n+l 


3. Using the remainder estimate , we obtain 


|f(2) — 13(x)| = |R (x)| 
M 3+1 

Con t 

1 24 

~ a * 35 


1 
= z5 = 0.0041..., for x € [1,2]. 


ms 


voile 


Thus 73(x) approximates f(x) with an error less than 5 x 1073 
om ile A 


Exercise F62 


(a) Calculate the Taylor polynomial T4(x) at m for the function 
f(z) = cose. 


(b) Show that T(x) approximates f(x) with an error less than 3 x 1073 
on the interval [3r /4, 57/4]. 
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2.2 Taylor series 


From Taylor’s Theorem we know that if a function f can be differentiated 
as often as we want on an open interval containing the points a and x, then 


f(x) = Ta (£) + Ry(z) gy tll ea (x) 
for n = 0,1,2,..., where 
Rala) = fOFD(c) (æa 
me O (n41)! ? 


for some c between a and x. So if the error terms Rp(x) tend to zero, then 
the Taylor polynomials T;,(a) tend to f(x) as n tends to infinity, and we 
can write f as an infinite series. Thus we have the following result. 


Theorem F64 


Let f have derivatives of all orders on an open interval containing the 
points a and x. If 


Rn(z) > 0 as n > oo, 


then 


Note that, for x = a and n = 0, the series in the statement of Theorem F64 
involves the expression 0°. By convention, we take 0° = 1 in such series. 


It follows from Theorem F64 that, if Ra(x) —> 0 as n —> oo, then we can 
express f(x) as a series whose terms involve powers of (x — a). 


Definition 
Let f have derivatives of all orders at the point a. The Taylor series 
at a for f is 


f" (a) 
2! 


© (n)(g 
5 Ee a? = Fels Pale—a) 4 Me aes 


If x is a point for which the Taylor series for f has the sum f(x) given in 
the statement of Theorem F64 (that is, when R,(x) + 0 as n > oo), then 
we say that the Taylor series is valid at the point x. Any set of values of x 
for which a Taylor series is valid is called a range of validity for the 
Taylor series. On any such range of validity, the function f is the sum 
function of the Taylor series. 


We can use Theorem F64 to obtain the following basic Taylor series. In 
each case we have indicated the largest possible range of validity. 
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Theorem F65 Basic Taylor series at 0 


1 0O 
Ee t=)", for lel <1. 
=T n—=0 
l 2 ee co (=1)e yee nt 
(b) ee for « E R. 
gc? gt = (Paes 
(c) OE a for x E€ R. 
o=] 
2 ES api 
(d) seler t=) To for x e€ R 
n—0 
mo = (eee 
m= 
Remarks 


1. Taking x = 1 in Theorem F65(e), we obtain the unexpected sum 


os (—* 
SA =1-4+4- 4+ = log. 
n=l A 
This series is known as the alternating harmonic series. 


2. In Subsection 4.1 of Unit D3 we defined the exponential function as 


Theorem F65(d) shows that e” is the sum function of the series 
co 


5 x” /n! for all x, not just for x > 0. Note that some texts define 
n=0 
e” = exp(x) using this power series. 


3. In this module, sin x and cosx are defined in terms of a right-angled 
triangle. Theorem F65 shows that sin x and cosx can be represented by 
power series for all x € R, and some texts use these series to define sin z 
and cosx in a way that does not depend on geometric ideas. 


Proof of Theorem F65 


(a) Let f(x) = 1/(1— x). You saw in Subsection 1.2 that the Taylor 
polynomial of degree n at 0 for f is 


n 
Tal) =1+r+ +a =Y a. 
k=0 
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Now 1+a2+27+--- is a geometric series with initial term 1 and 
common ratio x, which has sum 1/(1 — x) for |x| < 1, as you saw in 
Theorem D24 in Unit D3. Thus the result follows. 


Let f(x) =sinz. You saw in Worked Exercise F42 that the Taylor 
polynomial of degree n at 0 for f is 
3 5 mM »2m+1 m kp 2k+1 
By xv Le =I)" 
Peat ee ey a 
3! 5! (2m + 1)! se (2k + 1)! 
where n = 2m + 1 or n = 2m+2. 
By Taylor’s Theorem, we have 
FEV) n+1 


sinz = Trla) + Rn(z), where R(x) = (n+ 1)! = 7 


for some c between 0 and x. Since 


ftd (x) = +sin g or + cos 7, 


we have | f("+)(c)| < 1, so 
ak 


Gam oe 
n : 


|Rn(a)| < 


®. Here we have used the Squeeze Rule for null sequences and the 
fact that (a"/n!) is a basic null sequence; see Subsection 2.3 of 
Unit D2 Sequences. & 


Hence the result follows. 


The proof of part (c) is similar to that of part (b), so we omit the 
details. 


Let f(x) = e”. You saw in the solution to Exercise F59(a) that the 
Taylor polynomial of degree n at 0 for f is 


z2 x” U ak 
=0 


By Taylor’s Theorem, we have 
e” = Te) + Ryle), 
where 
ZO) n+1 g an 


=e — 


Role) = (n+1)!’ 


for some c between 0 and x. Now e° < e!*! and (a"*1/(n + 1)!) is a 
null sequence. 


®. The value of c depends on n, but it always lies between 0 
and z. @ 


Hence R(x) + 0 as n > ov, by the Squeeze Rule for null sequences, 
so the result follows. 
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(e) 


3 


Let f(x) = log(1 +x). You saw in the solution to Exercise F59(b) 
that the Taylor polynomial of degree n at 0 for f for n > 1 is 
2 3 =i n+1,.n n = k+1_k 
Sie ee g a 
2 3 n = k 


By Taylor’s Theorem, we have 


or) (c) n+l 


log(1 + x) = Ta(x) + Rn(x), where R,(x) = E ae 
for some c between 0 and z. 


Now suppose that 0 < x < 1. Since, by the solution to 
Exercise F59(b), 


forte) = (—1)"*2n! 
(I+ ayn 
we have 
= 1 n! n+l 
|Rn(z)| = me x ETE |z| 
[aie 


1 
S e > oo. 
mEt Én as n — œ 


Hence 
[0.6] 
(=i) iz” 
log(1 = —£_—., fr0<r<l. 
og(1 + 2) À = or0<2“< 


®. Here we have used the fact that 0 < c < x < 1, so 
erei and 1+e>1, 


We can include x = 0 in the range of validity because both log(1 + x) 
and the sum function are zero when g = 0. ® 


The proof that this Taylor series is also valid for —1 < x < 0 does not 
follow from the above form of R,,(x); we ask you to prove it later, in 
Exercise F72. | 


Convergence of power series 


So far in this unit you have seen how a function f can often be 
approximated near a point a by means of a Taylor series, which is an 
infinite sum of powers of (x — a). In this section you will study the 
behaviour of such power series in their own right, and consider functions 
which are defined by power series. 


3 Convergence of power series 


3.1 Radius of convergence 


We begin with a formal definition of a power series. 


Definitions 


Let a € R, x € Rand a, € R, n=0,1,2,.... Then the expression 
OO 
X an(x — a)” = ao + a (£ — a) + a2(x — a)? +--+- 
n=O) 


is called a power series at a in x, with coefficients an. We call a 
the centre of the power series. 


Notice that in the definition of a power series, we think of a as a constant 
and x as a variable. 


In Section 2 you saw that certain standard functions can be expressed as 
the sum functions of their Taylor series; for example, 


oO 
= ya for |z| < 1, 
n=0 


and 
o0 1) tig n 
log(1 + x) = p , for —1<zx<1, 
n=] 
and both these series are power series at 0 in x. All Taylor series are 
examples of power series. 


On the other hand, we can consider power series in their own right and use 
these to define functions. For example, you saw in Subsection 2.2 that we 
could have defined the exponential function by the formula 


X g” 
Dag — (xER). 
= n! 
Another example is the Bessel function 
[0.6] 
o (1)"(@/2)?" 
n=0 
which arises in connection with the vibration of a circular drum. diverges converges diverges 
In each of the above examples (where a = 0), the power series converges (ee 
an interval with centre a. The next result shows that this property is true a-Ra a+R 


for all power series; it is illustrated in Figure 8. We give the proof of this Figure 8 The radius of 
result in Subsection 3.2. convergence 
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Theorem F66 Radius of Convergence Theorem 


For a given power series Ss an(x — a)", exactly one of the following 


possibilities occurs. vo 
(a) The series converges only for x = a. 
(b) The series converges for all x. 


(c) There is a number R > 0 such that 
co 
` an(x — a)” converges if |x — a| < R 
n—0 
and 


(oe) 

ťa an(x — a)” diverges if |x — a| > R. 

n=) 
Moreover, in parts (a), (b) and (c) the series converges absolutely on 
the specified sets of convergence. 


Although it is usually sufficient to know that a series is convergent, the 
stronger result about absolute convergence is sometimes useful (for 
example, we will use it when we prove the Differentiation Rule in 
Subsection 4.2). Remember that a series X` an converges absolutely if the 
series J` |an| converges. We showed in Theorem D34 in Unit D3 that every 
absolutely convergent series is convergent. 


You have already met the following examples of the three possibilities in 
Theorem F66 (see Theorem F65(d) for (b) and Unit D3 for (a) and (c)): 
OO 
(a) > n! a” converges only for x = 0 
n=0 


co n 


(b) ‘2 L converges for all x 


n=0 


OO 
(c) yo converges if |x| < 1 and diverges if |z| > 1, so R = 1. 

n=0 
The positive number R in Theorem F66(c) is called the radius of 
convergence of the power series because the power series converges at 
those points whose distance from the centre a is less than R, and diverges 
at those points whose distance from a is greater than R. (Power series can 
also be defined for complex variables, in which case the points whose 
distance from the centre is less than R form a disc of radius R.) We extend 
the definition of the radius of convergence to the cases of Theorem F66(a) 
and (b) by writing 


3 Convergence of power series 


R = 0 if the power series converges only for « = a 
and 
R= œ if the power series converges for all x. 


In this last case R is used as a symbol, not a real number. 


Theorem F66(c) makes no assertion about the behaviour of the power 
series at the endpoints of the interval (a — R,a + R); in fact, a power series 
may converge at both endpoints, neither endpoint or exactly one endpoint, 
as you will see in Worked Exercise F46. 


The interval of convergence of the power series is the interval 
(a — R,a+ R), together with any endpoints of this interval at which the 
power series converges. 


Figure 9 illustrates the various possible types of interval of convergence of 


y anlx — a)”. 
n=0 


d eT ee 
a R a-R a a+R 
I i Se 
a-R a atR a-R a a+R a-R a at+R 


Figure 9 Possible types of intervals of convergence 


Theorem F66 tells us that each power series has a radius of convergence R, 
but it does not tell us how to find R. However, a power series is a 
particular type of series, so the convergence tests for series from Unit D3 
can be applied. 


We can find the radius of convergence of many power series by using the 
following version of the Ratio Test for series from Subsection 2.1 of 
Unit D3. 


Theorem F67 Ratio Test for power series 
[0.0] 

Suppose that ` an(x — a)” is a power series with radius of 
n=0 

convergence R, and that 


An+1 
an 


> L as n> œ. 


(a) If L is œ, then R = 0. 
(o) T L = 0, tinen e = ea: 
(e) me L > O, tnem a = iib 
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Proof We give this proof only in the case that a = 0. The proof of the 
general case is similar. It follows from the statement about absolute 


convergence in Theorem F66 that it is sufficient to consider the 
OO 


convergence of the series 5 |a,x"| as this is convergent precisely when 

co n=0 

pS ang” is convergent. This enables us to base the proof on the Ratio Test 
n=0 


for series which can only be applied to a series of positive terms. 


(a) Suppose that 


an+1 
——| > oo as n > œ. 
an 
Then, for « Æ 0, 
1 
lange] Jaai 
-~ = |x| + œ as n > oœ, 
|anx”| n 
co 
so ) |anx"| is divergent, by the Ratio Test for series. 
n=0 
Co 


Thus the series pe anz” converges only for x = 0, so R = 0. 
n=0 
(b) Now suppose that 
An+1 
an 
Then, for x Æ 0, 
[an1] 
|anx”| 


—> 0 as n > oo. 


An+1 


|z| => 0 x |z| = 0 < 1 as n > oœ, 


[0.6] 
so anx"| is convergent, by the Ratio Test for series. 
> gent, by 


n=0 
oo 


Thus the series `> anz” converges for all x € R, so R = œo. 
n=0 
(c) Finally, suppose that 
An+1 
an 


> Las n> oœ, 


where L > 0. 
If |z| > 1/L, then 


[apace | 


|anx”| 


Qn+1 


|z| + L|z| > 1 as n > oo, 


co 
so `> janz”]| is divergent, by the Ratio Test for series. Thus if 
n=0 
(oe) 
|x| > L, then the series Ds a,x" is not absolutely convergent and 


n=0 
hence is not convergent, so it follows that R < 1/L. 
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However, if 0 < |x| < 1/L, then 


lenge" | = An+1 


|z| > L|z|<lasn>o@, 


|anz”| n 


(oe) 
so > |anx”| is convergent, by the Ratio Test for series. Thus if 
n=0 oo 
|x| < L, then the series LD anz” is absolutely convergent and hence, 
n=0 
by Theorem F66, is convergent, from which it follows that R > 1/L. 


Taken together, these results show that R = 1/L, which completes 
the proof. E 


Worked Exercise F45 


Determine the radius of convergence of each of the following power series. 


(a) py eae (b) Da 
n=0 : n=0 


n! 


Exercise F603 


Determine the radius of convergence of each of the following power series. 


(a) So(a"+4")2" b SS Car (c) X n+2”)(z- 1)" 
n=0 n=1 n=0 


(d) - aa” 
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The next exercise concerns a power series which plays an important role in 
Section 4, where we will use it to prove a generalised binomial theorem. In 
this power series a can be any real number, but if a € {0,1,2,...} then the 
series has only finitely many non-zero terms and thus converges for all 
values of x. The exercise asks you to determine the radius of convergence 
of the power series for other values of a. 


Exercise F64 


Determine the radius of convergence of the power series 


a(a— 1 al(a—1)---(a-n+1 
BI i, EAE, 


where a £0,1,2,.... 


ltaxr+ 


The Ratio Test gives an open interval on which a power series converges. 
To determine the full interval of convergence of a power series with finite 
non-zero radius of convergence, we need to use other tests to find the 
behaviour at the interval endpoints. 


Strategy F13 
[0.0] 
To find the interval of convergence of the power series D anlx — a)”, 


n=0 


do the following. 


1. Use the Ratio Test for power series to find the radius of 
convergence R. 


2. If R is finite and non-zero, use other tests for series to determine 
the behaviour of the power series at the endpoints of the interval 
(a—R,a+R). 


You met a general strategy for applying the various tests for convergence 
or divergence of a series in Subsection 3.3 of Unit D3 (Strategy D13, which 
you can also find in the module Handbook). You may find it helpful to 
refer to this general strategy when applying step 2 of Strategy F13. 


Note in particular that we can use Strategy F13 to establish the largest 
possible range of validity of a Taylor series. Since a Taylor series is a power 
series, its largest possible range of validity is the interval of convergence 
obtained by using Strategy F13. 


In the following worked exercise and exercise, each of the power series has 
coefficient ag = 0, so the sum starts at n = 1. 


3 Convergence of power series 


Worked Exercise F46 


Determine the interval of convergence of each of the following power series. 


®ve OL oye 
n=1 n=1 n=1 


Solution 
In each case, we apply Strategy F13. 
(a) Here an —ltor t=O. 

1. Since 
An4+1 
ün 
we have R = 1, by the Ratio Test. Thus (by the Radius of 

Convergence Theorem) this power series 


= for j= n 


e converges for —1 < x < 1, 
e diverges for x > 1 and x < —1. 


co 
2. Itz = 1, then the power series is 5 1”, which is divergent by 
the Non-null Test. n=1 


o0 
If x = —1, then the power series is >S’, which is also 
divergent by the Non-null Test. "=! 

Hence the interval of convergence is (—1, 1). 

(ib) “Herea, = l/n, tor nm — 1, 2.5.3: 


1. Since 


1 n il 
x = = ——_ 
n+1 1 1+1/n 


we have R = 1, by the Ratio Test. Thus this power series 


An+1 
an 


> l as n> œo, 


e converges for —1 <a < 1, 
e diverges for x > 1 and g < —1. 


co 
1 
2. If «= 1, then the power series is > —, which is a basic 
n 


divergent series. n=l 


(0 


n 


, which is 


co 
If x = —1, then the power series is D 
m= 


convergent by the Alternating Test. 


Hence the interval of convergence is [—1, 1). 
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le) Here, = 1/(2'n*); form — 12 


1. Since 
An+1 E 1 7 In? 
an | 2"+1(n +1)2 1 
1 


1 
=~, > = = 00, 
aU a 


we have R = 2, by the Ratio Test. Since a = 3, this power 
series 


e converges for 1 < a < 5, 
e diverges for x > 5 and x < 1. 


®. This follows because 
|z -3| 2 <= -—2< 4-322 
= a 
by the rules for rearranging inequalities. © 
2. If « = 5, then the power series is 


a ik yi 
Do a 
m= n= 


which is a basic convergent series. 
If x = 1, then the power series is 


oo oo 

1 (-1)" 
Sr aa er 
n=l = 


which is convergent by the Absolute Convergence Test. 


Hence the interval of convergence is [1, 5]. 


Exercise F65 
Determine the interval of convergence of each of the following power series. 


(a) Dona") De -5p 
n=1 


n=1 


3.2 Proof of the Radius of Convergence 
Theorem (optional) 


In what follows, we often use without reference the fact that an absolutely 
convergent series is convergent. 


To prove the Radius of Convergence Theorem, we need the following 
preliminary result. 
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Lemma F68 


co 


If the power series Ss ang” converges for some xp Æ 0, then it is 


n=0 
absolutely convergent on the interval (—|zo|, |xo]). 


Figure 10 illustrates the statement of the lemma in the two possible cases 
zo > 0 and x < 0. 


ASSUME 
convergence 
at Xo 


ro > 0 toxu 
— —— 
0 0 


DEDUCE 
convergence on interval 


(—|zo], |zo]) 


Figure 10 The two cases of Lemma F68 


Proof of Lemma F68 First we write r = |xo|. Since the series 


oO 


tP anxo is convergent, the sequence (anxo) is null by Theorem D27 in 


n=0 
Unit D3, and hence there is a number K such that 
langle" = janzo| < K, for n = 0,1,2,.... (6) 
[oe 
Suppose that |z| < r. To prove that 5> anx” is absolutely convergent, we 
write n=0 
Ay M 
Ant” = anr” (=) . 
r 


Then, by inequality (6), 
a (=) l 
r r 


oo n 
Since |x| < r, we have |z|/r < 1, so the geometric series > K (=) is 
r 


lana” | = |an|r” 


n=0 


convergent. Hence, by the Comparison Test (Theorem D30 in Unit D3), 
OO 


J |a,x"| is convergent, as required. E 


n=0 


We can now give the proof of the Radius of Convergence Theorem, which 
we first state again for convenience. 
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Ò |c| a, R=supE 


Figure 11 The case when 
|z| < R 


0 R=supE 2 |z| 


Figure 12 The case when 
|x| > R 
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Theorem F66 Radius of Convergence Theorem 


For a given power series ` an(x — a)”, exactly one of the following 


possibilities occurs. =L 
(a) The series converges only for x = a. 
(b) The series converges for all x. 


(c) There is a number R > 0 such that 


[0.6] 
DE an(z — a)” converges if |x — a| < R 
=O 

and 


(oe) 

> an(x — a)” diverges if |x — a| > R. 

n=0 
Moreover, in parts (a), (b) and (c) the series converges absolutely on 
the specified sets of convergence. 


Proof We give the proof only in the case a = 0. The proof of the general 
case is similar. 


First we define the set 


[oe] 
B= fa ER: `> ang” is convergent} f 


n=0 
If E = {0}, then possibility (a) holds. 


If E is unbounded, then for every x € R there exists some zo € E such 
0O 


that |x| < |ao|. Thus the series > anz” is absolutely convergent, by 
n=0 
Lemma F68. Since this is true for every x € R, it follows that 


possibility (b) holds. 

Otherwise, the set F is bounded and contains a point zo 4 0. Then 
(—|xo|, |zo|) C E, by Lemma F68, so sup E > |zo|. We define R = sup F, 
where R is the radius of convergence. 


If |x| < R, then we can find x; € E such that |x| < x1; see Figure 11. 
(oe) 


Thus, by Lemma F68, the series 7 anx” is absolutely convergent. 
n=0 
If |x| > R, then we can find x2 > R such that |x| > x2; see Figure 12. So 
(oe) [0,6] 0O 
> ang” is divergent (since if `> anz” is convergent, then bp anT3 is 


n=0 n=0 n=0 


convergent, by Lemma F68). Hence possibility (c) holds. 


The above arguments show that in each case the given power series is not 
just convergent but absolutely convergent at each interior point of its 
interval of convergence. 


This completes the proof. | 


4 Manipulating Taylor series 


In Section 2 you saw that many functions can be represented by a Taylor 
series; if there exists R > 0 such that 


f(x) = 


n=0 
for |x — a| < R, then the expression on the right is the Taylor series at a 
for f. 


In this section you will meet several rules which enable us to obtain ‘new 
Taylor series from old’, and so build on the list of basic Taylor series given 
in Theorem F65. Some of the rules for manipulating Taylor series are 
similar to the corresponding rules for continuous or differentiable functions. 


We begin by noting that we can obtain new Taylor series from old by 
replacing x with another expression. For example, you have already seen 
the following Taylor series at 0: 


i co 
ee for |x| < 1. (7) 


This power series has radius of convergence 1. Since |—2| < 1 if and only if 
|z| < 1, we can deduce from equation (7) that 


1 


OO 
—— S15 aE —1)”z”, for |æ| <1, 
Pa r+ 2 ys or |z| 


and this power series also has radius of convergence 1. 


Similarly, since |3x?| < 1 if and only if |£] < 1/3, we can deduce from 
equation (7) that 


1 


13 = | + 3x? + (3x7)? + (31°) +.. 


OO 
= ee for |r| < 1/V3, 
n=0 


and this power series has radius of convergence 1/73. 


4 Manipulating Taylor series 


231 


Unit F4 Power series 


4.1 The Combination Rules and the Power 
Rule 


We now study the Combination Rules for Taylor series. 


Theorem F69 Combination Rules for Taylor series 


Let f and g be functions that can both be represented by a Taylor 
series at a, and suppose that 


OO 
Zs Smee — a)”, for |x — a| < R, 
n=0 


(0,6) 
gla) = ae —a)”", for |e ~ a| < R 
w=) 


Then the following hold for r = min{R, R'} and à € R: 
OO 
Sum Rule (f +g)(£) = Do + bn) (z — a)”, 
n=0 
for |x —al <r 


Multiple Rule Af(x) = `> àan(z — a)”, for |z — a| < R. 
n=0 


Remarks 


1. The Sum and Multiple Rules for Taylor series are simply special cases 
of the Sum and Multiple Rules for general convergent series (see 
Theorem D25 in Unit D3.) 


2. The radius of convergence of the Taylor series for f + g may be larger 
than r = min{R, R’}: Theorem F69 simply asserts that it must be at 
least r. For example, we can use the standard geometric series and the 
Sum and Multiple Rules to verify that the Taylor series at 0 for the 


1 
d = — 4 — 
tog an g(x) fon Toe 


functions f(x) = 
OO 

f(a) =1+r +r? +r t =F = 
n=0 


and, by replacing x with x/2, 


g(x) =—(ltata?+a°t+---)+ (1+ iet Ga? + pare +--) 


0O 
1 
sy (ai =) of 
a 2” 
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The Taylor series for each of f and g has radius of convergence 1, by the 
Ratio Test for power series, so Theorem F69 tells us that the radius of 
convergence of the Taylor series for f + g is at least 1. However, the 


1 
Taylor series for the function (f + g)(x) = 1-2/2 is 


0O 
1 
(f+ g)(a) =14 pot ja? tia t hatt =o a", 
n=0 
which has radius of convergence 2, by the Ratio Test for power series. 


Worked Exercise F47 


Find the Taylor series at 0 for f(a) = cosh z. 


Solution 


We can write 
cosha = 3(e*+e*), forrzeR. 


We know that, for x € R, 


tT] E E ge a ae 
ee or 3) ar 
®. This is one of the basic Taylor series given in Theorem F65. © 


and so, by replacing x with —2, 


PO g age 
=T = — — _ — —— RRR — ns eee 
e *=1 at oy a (—1) ae ; 
Then, by the Sum Rule, we deduce that 
2 4 2n 
rgt a E ete a ee Pee 
e+e (145454 oat T for x € R, 


since the odd-powered terms cancel. It then follows, by using the 
Multiple Rule with À = $, that for x € R, 


2 4 2n 
— hy se =r = 23 as 30% z 
cosh z = 5(e” +e Nea onary * Gn! 


Exercise F66 


Find the Taylor series at 0 for each of the following functions. 


(a) f(x) = sinh z (b) f(x) = log(1 — x) + — 
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We now look at how we might find the Taylor series at 0 for the function 
f(a) = 


l-z 


1 
We can express i ak in the form (1+ 2) x ja using the Taylor 
-g -T 


we have 
—2£ 


series for T 


1+ 1 
-= 
ee CINSI 


= (1+) x (+r +r? +r +r), for jel <1. 


If we then simply multiply out these two brackets and collect together the 
multiples of successive powers of x, we get 


1+ 


i =1x(1+r +r? tei + +H) 
-r 


+e x(1+r+r HHHH) 
= 142r 4 Oe? 42? heed De H, 


and this power series has radius of convergence 1. 


To justify multiplying together Taylor series in this way to obtain further 
Taylor series, we now state and prove the Product Rule for Taylor series. If 
you are short of time, then you may prefer to skim read the proof. 


Theorem F70 Product Rule for Taylor series 


Let f and g be functions that can both be represented by a Taylor 
series at a, and suppose that 


OO 
Ze = eae — a)”, for |x—a|< R, 
n=0 


oO 
e) = ale —a)", for |z—a| < R’. 
n=0 
Then if r = min{R, R’} we have 
co 
ae) = Sale —a)", for |z—-—a| <r, 
n=0 
where 


n 
Cn = aobn + aibn—1 + +--+ + an—1b1 + Ando = > Qo ae 
p=) 


Note that although the expression for the coefficient c, looks rather 
complicated, it is just the result of multiplying the two Taylor series term 
by term and summing all the resulting coefficients of (x — a)”. 
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Proof of Theorem F70 For simplicity, we assume that a = 0. 
Take n > 2 and put m = we the integer part of in. Then 


n n 
>, ajx? x `> bjr = S cpa” + the sum of moe terms a;b; git 
j j k=0 
in Sa! aT a) with i+j >n. 
i=0 j=0 


Thus, by the Triangle Inequality, 


sar oo gi) — Sea < the oun of thone terms laibja’ x’ +3] 
i=0 j=0 k=0 


j=0 


But all the latter terms are included in the expression 


n n m m 
X laz] x X. bja] -X Jaiz] x X. lojal, 
i=0 j=0 i=0 j=0 


and the other terms in this expression are all non-negative. 


m m 
®. Note that all of the terms |a;bjxtti| in 3 |a;x"| x > |b; | have 
i+j<2n<n. @ i=0 j=0 


Hence 


n nm m 
ie aitt x `> bjxi — 2 cya 
i=0 j=0 = 
n 


< Dol «etl So jesz 1x Deh (8) 
j=0 


i=0 


Co 
Now suppose that |x| < r, so the series 3 jajx*| and `, |b;27| are both 
i=0 j=0 
convergent, with sums s and t, respectively. 


@®. Here we have used the fact that a power series is absolutely convergent 
at each interior point of its interval of convergence. ®@ 


As n — oo, the right-hand side of inequality (8) tends to st — st = 0, since 
m — oo. Thus, by the Limit Inequality Rule (Theorem D11 in Unit D2), 
[0,6] 


) ckx" converges, and 


k=0 
29 . 
Daa aS u x Se: 
i=0 j=0 
This completes the proof of the Product Rule. | 


in Sja x Sjur withi + j >n. 
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Worked Exercise F48 


1+ 


Find the Taylor series at 0 for the function f(x) = aaa 


Solution 


We first write 
1l+a Lire, 1 
=a l-g 1-2 


We know that, for |æ| < 1, 


1 
1 De + 20? + 2a Hoe + 2a” H 
= 8b 
and 
1 
ero a 


@. We discussed the first of these two series just before stating the 
Product Rule, and our informal argument there is justified by the 
proof of the Product Rule. The second series is a basic Taylor series 
listed in Theorem F65. .©& 


Hence, by the Product Rule, 
l+a l+a 1 


(=e) "fas tz 

(1 4- 2m d Da” de Dap? de oce d ee) 

x (l+o+e?4+a°4+---+274---) 

=1+(2+1l)r+(24+24+1)2?4+--- 
+(2+2+4---+241)2"+--- 

= 1 +3 +52? +--+ (2n +1)z”+---, 


ioe e <i 


[0.6] 
Thus the Taylor series for f at 0 is X (2n a for ee 


=À) 


Exercise F607 


Determine the Taylor series at 0 for each of the following functions. 


(a) Fæ) = 0ta) b) Fe) = y 
lg 


(c) f(x) = asa 


Hint: In part (c), use the solution to Worked Exercise F48. 
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Exercise F68 


Determine the Taylor series at 0 for each of the following functions. In each 

case, indicate the general term and state a range of validity for the series. 
1 

(a) f(z) = sinh z + sin x (b) f(z) = {ioe 

Hint: In part (a), use the solution to Exercise F66(a). 


Exercise F69 


Determine the first three non-zero terms in the Taylor series at 0 for the 
function f(x) = e?(1—«)~?, and state a range of validity for the series. 


Hint: Use the solution to Exercise F67(b). 


4.2 The Differentiation and Integration 
Rules 


We have seen that the hyperbolic functions have the following Taylor series 
at 0: 
3 5 2 4 
. x x x x 
sinha =g + ar tate and cosha=1+ +7 +77: 
each with radius of convergence oo. Notice that the derivative of the 
function sinh x is cosh z, and that term-by-term differentiation of the series 
3 5 
x T : : T 
oa a gives the series 1 + 5 + 7 
can obtain the Taylor series for the derivative of a function f simply by 
differentiating the Taylor series for f itself. 


+--+. It looks as though we 


Our next two results show that we can differentiate or integrate the Taylor 
series of a function f term-by-term to obtain the Taylor series of the 


corresponding function f’ or | f, respectively, where we make an 


appropriate choice for the constant of integration. 


Theorem F71 Differentiation Rule for Taylor series 


The Taylor series 
Fe) = San(w— a)" and 9(c) = > nan(x — a)" 
=) w= 


have the same radius of convergence, R say. 


Also, f(x) is differentiable on (a — R,a + R), and 
O= for e= a< R 
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-R 0 |z| r R 
Figure 13 The relationship 
between r, |z| and R. 
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Note that the two series in the Differentiation Rule may behave differently 
at the endpoints of their respective intervals of convergence. For example, 
the power series 


1 1 
pita + aya + =o 
and 
1 il 1 1 l Si 4 


both have radius of convergence 1. However, the first series converges at 
both +1, whereas the second series converges at —1 but diverges at 1. 


Proof of the Differentiation Rule (optional) For simplicity, we 
assume that a = 0. 


(oe) OO 
Let the series J anx” and a nanz”™! have radii of convergence R 


n=0 
and R’, respectively. We prove iai R'=R. 


We first show that R’ > R. To prove this, suppose that |x| < R. Now 
OO 


choose r such that |z| < r < R, as shown in Figure 13. Then > anr” is 

n=0 
convergent, so (anr”) is a null sequence. Thus there is a positive number 
K such that 


lant |S K, for n= 0,1,2,.... (9) 
Then 
n»~n—1 n—-1 
K 
[nans !]| a da Š =n (2) > forn= l; 2e (10) 
prx i T 


[0.6] 
by inequalities (9). Since |x|/r < 1, the series Son (\2|/r)"! converges. 


n=1 
®. This follows from the solution to Exercise F65(a). .@ 
Therefore, by statement (10) and the Comparison Test (Theorem D30 in 
OO 
Unit D3), i |na,x” t| is convergent for all |z| < R. This proves that 


n=1 


R' >R. 
Next we show that R > R’. To prove this, suppose that |x| < R’. Then 
co 


F: nanx”™! is absolutely convergent and 


lana | = Mant" 1/2 4 < |æ||jnans" t], forn =1,2,.... 


co 


Therefore, by the Comparison Test, y |a,x"| is convergent for all 


n=l 
|x| < R'. Thus R > R’, so we deduce that R' = R. 


oO 


Differentiating the terms of 5 nanx”™!, we deduce that 


n=1 
OO 
D n(n —1)anx"~? also has radius of convergence R, (11) 
n=2 


by an analogous argument to the one above. 


OO 
Let f(z) = a,x”. We now use statement (11) to prove that f’ exists 


n=0 


and has the required form on (—R, R). 


Take x € (—R, R), and choose r such that |x| < r < R. Then, for all h 
such that |x + h| < r (see Figure 14), 


_ = an (a +h)” nz” 
Het HE) naga! = yo AEN" E E. naga”! 
n=1 n=1 n=l n 
x+h)” — r” — neth 
=y a aes s (12) 
n=2 
|z + h| 
-R 0 j| r R 


Figure 14 The point |x + h| 


Now we apply Taylor’s Theorem to the function p(x) = x” on an open 
interval containing x and x +h. We obtain 


1 
pla +h) = (@ +h)” =a" + nah + nln — Len Ph’, 


where cp lies between x and x + h. 
®. By Taylor’s Theorem, we have 
p(z + h) = p(x) + p'(x)((a@ +h) — z) + Rı(2), 
where p'(x) = nx"! and the remainder term Rj(a) depends on the second 
derivative p’(r) = n(n —1)2"-?. # 
Then |cn| < r, so 
|(£ +h)” =2" —ne"" "hl < 5n(n — 10 aml Fae (13) 


By equation (12) and inequality (13), together with the Triangle 
Inequality, we obtain 


Het FO) nage"! < gh So nln- Danje”. (14) 


n=1 n=2 


(oe) 
Since r < R, the series `> n(n —1)ayr”~? is absolutely convergent, by 
n=2 


statement (11). Thus, by inequality (14) and the Limit Inequality Rule, 


F(a) = Yim PEt IO) _ 


(oe) 

) nant” |, for |x| < R. 
h>0 h 
= 
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We can now easily obtain the Integration Rule from the Differentiation 
Rule. 


Theorem F72 Integration Rule for Taylor series 


The Taylor series 


f(z) = ae —a)” and F(x) = ss 
w—O) A 


=0 


an 


L 
o a) 


have the same radius of convergence, R say. 
Also, if R > 0, then 


Jro dz = (x), for e = a) < fh, 


Remarks 


1. As in the Differentiation Rule, the two series in the theorem may behave 
differently at the endpoints of their respective intervals of convergence. 


2. The final conclusion says that F' is a primitive of f on (a — R,a + R). It 
is sometimes expressed in the following way: 


J (Sae — 0) dx = is ale — arr’: 


We find c by putting x = a into the equation. 


Proof of the Integration Rule The two series have the same radius 
of convergence, by the Differentiation Rule applied to F. 


By the same rule, F’ = f, so F is a primitive of f on (a— R,a + R). Oo 


Worked Exercise F49 


Find the Taylor series at 0 for f(x) = tan™! z. 
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Exercise F70 


Find the Taylor series at 0 for: 
(a) f(w)=(1-2)% (b) f(x) = tanh“? z. 
Hint: You may find the solution to Exercise F67(b) helpful in part (a). 


Exercise F71 


Determine the Taylor series at 0 for the function f(x) = e~*”. Deduce that 


1 n 
= i, 4 (i) 
Bes Ve Se as as 
f oo o pm 


Exercise F72 


Use the Taylor series for 1/(1 + x) at 0 to determine the Taylor series for 
the function g(x) = log(1 + x) at the same point, and state a range of 
validity for this series. 


In Exercise F72 you established that the Taylor series at 0 for the function 
f(x) = log(1 + x) is valid on the interval (—1,1). In Section 2 we proved 
that this Taylor series is valid on [0,1], so by combining these results we 
see that the full range of validity for the Taylor series at 0 for the function 
f(x) = log(1+ x) is the interval (—1, 1], as stated in Theorem F65(e). 
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4.3 The General Binomial Theorem and the 
Uniqueness Theorem 


In Subsection 3.4 of Unit D1 Numbers you met the Binomial Theorem, 
which states that, for each positive integer n, 


(1+) = = (i) at, 


k=0 
where 
n\ n\ n! o n(n=-1) -e (n—-k+1) 
(6) = 1 and C) = k(n — k)! = —_ EL? for k € N. 


This gives the Taylor series for (1 + x)” and is valid for all x € R. In fact, 
a similar result known as the General Binomial Theorem holds for more 
general powers of (1 + x), but with a restriction on the values of x for 
which it is valid. 


Theorem F73 General Binomial Theorem 


For a € R, 
GE = = (*) go, tora) <i, 
n=0 
where 
(5) =i armel (s) = A for n € N. 
Remarks 


1. The Binomial Theorem is usually stated as a sum of powers of x*. We 
have stated the General Binomial Theorem as a sum of powers of x” in 
order to match the Taylor series in the rest of the unit. Note that this 
means that the role of n in the generalised binomial coefficient is the 
same as the role of k in the normal binomial coefficient. 


2. The coefficient (*) is known as a generalised binomial coefficient. 


An example of calculating a generalised binomial coefficient is 


(-3) CDCDCD--Ch-ney 
jegeg: E . (2n — 1) 
2n! i 


=(-1)" 


We now give the proof of the General Binomial Theorem. If you are short 
of time, then you may wish to skim read this proof. 


Proof of the General Binomial Theorem 


Let 
= a n — —a 
f(x) = 2 (s) x” and g(#)=f(x)1+z2)%, for |z| <1. 
We first note that the series f(x) = a a x” converges for |æ| < 1, as 
n=0 


proved in Exercise F64. We want to prove that g(x) = 1 and hence that 
f(z) = (1 + 2)°, for all x with |z| < 1. 


Using the Product Rule for differentiation, we differentiate the expression 
for g to obtain 


g'(x) = f(a) +2) | —af(x)(1 + x) 72! 
= (1 +2) f(x) - af (2) (1 + s). 


Now, using the Differentiation Rule, 


(1 +x) f'(x) —af(zx) = GS (*) gn! PoS 


n=l 


(15) 


RO O 2-9 (0) " 
=A E ( a) os =a (*) i, 
Leha Detd (pga) 
and 
OO 
Hence 
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We now use algebraic manipulation and the definition of the generalised 
binomial coefficient to simplfy the expression in square brackets in 
equation (16). 


w+) (,0,) +00—0) (2) =r nea lo— nt lo 9) 


+(n-a) B 


_ &la-1)- (a-n+1) 
n! 


= 0; 


Working backwards, it follows from equation (16) that 
(1+ x)f'(x)—af(x)= 0, and then from equation (15) that g'(x) = 0. 


So, g(x) is a constant. Hence 


g(a) = g(0) = F01 +07" 


as required. E 


Worked Exercise F50 


Use the General Binomial Theorem to find the first three non-zero terms 
in the Taylor series at 0 for the function f(x) = (1+ 2zx)~®. 
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Exercise F73 


Use the General Binomial Theorem to find the first three non-zero terms 
in the Taylor series at 0 for the function f(x) = (1 + 4r)". 


We now have a variety of techniques for finding Taylor series: 
e Taylor’s Theorem 

e the Combination Rules 

e the Product Rule 

e the Differentiation and Integration Rules 

e the General Binomial Theorem. 


But how do we know that these different techniques will always give us the 
same expression for the Taylor series of a given function? To end this 
section we prove a result which states that there is only one Taylor series 
for a function f at a given point a. Thus any valid method gives the same 
Taylor coefficients. 


Theorem F74 Uniqueness Theorem for Taylor series 
If 


co (oe) 
Sele —a)"= Sy bale — a)”, for |z—a| < R, 
n=0 p=) 


HNO Gp = Op. fonn =O, I, cone 


Proof Let 
fas S iol —a)” and g(x)= Shale —a)”, for |x —al| < R. 
n=0 n=0 
If we differentiate both equations n times using the Differentiation Rule, 
and put x = a, then we obtain 
f(a) = (n)an and g(a) = (n!)bn- 


Since f(x) = g(x) for |x — a| < R, it follows that f™ (a) = g(a). 
Hence an = bn, for all n = 0,1,2,.... a 
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5 Numerical estimates for 7 


One of the problems that has fascinated mathematicians for thousands of 
years has been how to determine accurately various important irrational 
numbers such as V2, 7 and e. In this section you will see the role of Taylor 
series in the numerical estimation of 7, and meet an ingenious proof that 7 
is irrational. 


Historical estimates for 7 


The use of the symbol m to denote the ratio of the circumference of a 
circle to its diameter is relatively new, being first introduced by 
William Jones in 1706. Its use was popularised by Leonhard Euler 
who employed it in his Introductio in Analysin Infinitorum of 1748. 


Early estimates for m include the following: 


Mesopotamia (c.2000 BCE) In 1936 a Babylonian clay tablet was 
excavated which gives the ratio of the perimeter of a regular hexagon 
to the circumference of the circumscribed circle as = J To (the 
Babylonians used the sexagesimal (base 60) system rather than the 
decimal system). This corresponds to a value for 7 of 3.125. 


Egypt (c.1900 BCE) The Egyptians found by experience that they 
could approximate the area of a circle with diameter d by reducing d 
by one-ninth and squaring it. This corresponds to a value for 7 


of 28 ~ 3.1605. 


Archimedes (c.250 BCE) Archimedes (c.287—c.212 BCE) considered 
hexagons inside and outside a circle and compared their perimeters 
with the circumference of the circle. This gave him a value for m 
between 3 and 3.464. He then replaced the hexagon with a 12-sided 
polygon and recalculated the lengths. He continued by progressively 
doubling the sides of the polygon until he reached a polygon of 96 
sides. This gave him a value for 7 between 320 and 34, or 

3.14084 < m < 3.14286, which is correct to two decimal places. 
Archimedes’ method was described in some detail in Subsection 5.2 of 
Unit D2. 


China (c.100—500) Zhang Heng (78-139) considered the ratio of the 
area of a square to the area of a circle and the ratio of the volume of a 
cube to the volume of a sphere, and was led to an approximation for 7 
of v10 (~ 3.162). He also calculated 7 as $38 (= 3.1724). Liu Hui 
(c.220-c.280) used a polygonal method similar to that of Archimedes. 
He doubled the sides of a regular polygon until he reached a polygon 
with 3072 sides, from which he calculated a value for m of 3.14159. 

Zu Chongzhi (429-500) deduced that 3.1415926 < m < 3.1415927, 
which was the most accurate approximation for m for almost a 
millennium. It is not known how he obtained this result but it is 
possible that he considered polygons with 24576 (= 213 x 3) sides. 


India (499) Aryabhata (476-550) in his Aryabhatiya included the 
following statement: ‘Add 4 to 100, multiply by 8 and add 62000. 
The result is approximately the circumference of a circle of which the 
diameter is 20000.’ This gives an approximate value for m of 3.1416. 
Although Aryabhata does not say how this value was found, it is 
likely that it was done by the method of polygons, using polygons 
with 384 sides. 


With the development of calculus in the seventeenth century, new formulas 
for estimating 7 were discovered, including Wallis’ Formula which you met 
in Subsection 3.2 of Unit F3 Integration: 


2 1338557 2—1 2+1 


5.1 Tangent formulas 


In Worked Exercise F49 you saw that the Taylor series at 0 for the 
function tan™! is 


tan ta2=-2 +5 = He, for x € [-1,1]. 
In particular, with z = 1, 

my 1,1 1, 

4 3 5 7 


The first of these two series is known as Leibniz’s series and the second as 
Gregory’s series, although these names are often interchanged. 


The second series is not very useful for calculating 7, as its successive 
partial sums converge far too slowly. The smaller the value of x, the faster 
the Taylor series for tan~! x converges, so fewer terms are needed to 
calculate its sum to a given accuracy. 


To obtain series that are more effective for calculating 7, we can use an 
addition formula for tan~!. To derive this formula, first recall the addition 
formula for tan (given in the module Handbook): 


tana + tanb 
t b) = ———. 
mala 1 — tana tanb 


If we now put x = tana and y = tanb, we have 


tan(a + b) = -a 
1 — zy 
so it follows that 
tan-t2+tan ty = tan”! (- ty ), for x,y E R, (17) 


provided that tan~'x+ tan! y lies in (—1/2,7/2), the image set of tan7!. 
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For example, applying formula (17) with z = i and y = z, we obtain 


tan”! ($) + tan” '(4) =tan +1 = 7/4. 


Using the addition formula (17) with small values of x combined with the 
Taylor series for tan~! at 0 gives efficient ways of calculating m. For 
example, repeated application of the formula gives the following (we omit 
the details): 
=i ft -1/1 
4tan™' (E) — tan" (355) = 7/4, 
si al =t =I f 1 
6tan™ ($) +2tan™ ($) +tan™ (z5) = 7/4. 


The first of these formulas is called Machin’s Formula. John Machin 
(1680-1751) used the formula to calculate the first 100 decimal places of 7, 
and in 1974 such formulas were used to calculate 7 to a million decimal 
places. More recently, highly ingenious methods (based on techniques due 
to Gauss for evaluating integrals approximately) have been used to 
calculate 7 correct to many billions of decimal places. 


For your interest, we now list the first 1000 decimal places of r. 


3.1415926535 8979323846 2643383279 5028841971 6939937510 
5820974944 5923078164 0628620899 8628034825 3421170679 
8214808651 3282306647 0938446095 5058223172 5359408128 
4811174502 8410270193 8521105559 6446229489 5493038196 
4428810975 6659334461 2847564823 3786783165 2712019091 
4564856692 3460348610 4543266482 1339360726 0249141273 
7245870066 0631558817 4881520920 9628292540 9171536436 
7892590360 0113305305 4882046652 1384146951 9415116094 
3305727036 5759591953 0921861173 8193261179 3105118548 
0744623799 6274956735 1885752724 8912279381 8301194912 
9833673362 4406566430 8602139494 6395224737 1907021798 
6094370277 0539217176 2931767523 8467481846 7669405132 
0005681271 4526356082 7785771342 7577896091 7363717872 
1468440901 2249534301 4654958537 1050792279 6892589235 
4201995611 2129021960 8640344181 5981362977 4771309960 
5187072113 4999999837 2978049951 0597317328 1609631859 
5024459455 3469083026 4252230825 3344685035 2619311881 
7101000313 7838752886 5875332083 8142061717 7669147303 
5982534904 2875546873 1159562863 8823537875 9375195778 
1857780532 1712268066 1300192787 6611195909 2164201989 


There are several mnemonics that can be used to recall the first few digits 
of 7, with the word lengths giving the successive digits of 7. For example, 
one such mnemonic is: 


May I have a large container of coffee? 
3. 1 4 1°45 9 2 6 
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5.2 Proof that ~ is irrational (optional) 


We finish the unit with an interesting proof which uses several ideas that 
you have met in earlier analysis units. You began your study of analysis in 
this module with Unit D1 where you first considered rational and irrational 
numbers, which together make up the real numbers whose properties lie at 
the foundation of analysis. Now we come full circle and use some of the 
tools of analysis that you have learned to prove that 7 is irrational. 


The first proof that 7 is irrational was given by Johann Heinrich Lambert 
in 1766, but the elegant, shorter proof that we give here was found by 
Ivan Niven in 1947. 


Theorem F75 


The number 7 is irrational. 


Proof We prove that 7? is irrational, from which it follows that 7 is 
irrational. The proof is by contradiction and the method is rather unusual, 
so we begin by outlining the two major steps. 


First we show that if f is any polynomial function such that 


0< f"(2) +a f(z) <1, for0<2<1, (18) 
then 
0< f(0) + f(1) < - (19) 


Next we show that if t? = a/b, for a,b € N, then there is a polynomial 
function f such that statement (18) is true but inequalities (19) do not 
hold. This contradiction shows that m? must be irrational. 


Let f be a polynomial function satisfying statement (18), and put 
g(x) = f'(x) sin rg — r f(x) cosrz. 
Then 
g'(x) = f" (x) sina + f (xyr cosrz — r f'(x) coste +n’ f(x) sin rx 
= (f"(x) + T? f(x)) sin mz. 


By the Mean Value Theorem (Theorem F37 in Unit F2), there exists 
c € (0,1) such that 


g(1) — g(0) = g'(c) = (F” (©) + 7° f (0) sin zc. 


Hence 0 < g(1) — g(0) < 1, by statement (18) and the fact that 
0 < sin7rce < 1. But 


g(1) — g(0) = x(F(0) + f0)), 
by the definition of g, so statement (19) follows. 


5 Numerical estimates for m 


249 


Unit F4 Power series 


250 


Now suppose that 7? = a/b, where a,b € N. Take N € N so large that 
naN /N! <1, 


®. This is possible because (a"/n!) is a basic null sequence: see 
Theorem D7 in Unit D2. @ 


and put 
1 
pla) = 2% (=a) (20) 
1 
= WI (eya + enqiz’*} a conx?), 


where the coefficients c, are integers, for N < k < 2N. Then we have 
0< pe; 1/N!, fr0<zrz<1, (21) 
by equation (20). Also, for k = 0,1,..., 


m 0, 0<k<N,k>2N, 
p?(0) = f 
Ek! ON <R<ON, 
N! 


so p™® (0) is an integer. Hence p)(1) is also an integer, by the symmetry 
of the function p under the change of variable x’ = 1 — x. 


Now consider the polynomial function 

f(z) = a p(2) — aX~*bp® (2) + --- + (-1)NoNpON) (2), 
which has degree 2N. Then f(0) and f(1) are both integers, so 
statement (19) is false, since we know that 7 > 1. Finally, 

f"(a) = aNp) (x) =a "bp (a) +--+ + (—1)% Tab tp) (2), 


(2N+2) 


since p = 0. Hence 


f"(a) +9 f(a) = f") +F f(@) = Fa ple) = a pla), 


by telescopic cancellation. Thus, by statement (21) and our choice of N, 
0< f"(x) +a? f(a) < na /N!<1, for0<2#<1, 


so statement (18) does hold. This completes the proof. E 


Learning outcomes 


Summary 


In this unit you have seen that many functions can be approximated by 
Taylor polynomials: for a function f that is n-times differentiable on an 
open interval containing the point a, the Taylor polynomial of degree n at 
a for f is the polynomial 

1 
Tala) = fla) + Fle- a) + 5 
You also saw how to estimate the accuracy of this approximation using 
Taylor’s Theorem and an upper bound for a remainder term R,(x) that 
depends on the (n + 1)th derivative of f. For a function f that has 
derivatives of all orders on an open interval, you learned that at points 
where the remainder term tends to zero as n —> oo, f can be represented 
by a convergent power series known as the Taylor series for f. 


(z-a) +-+ = (x —a)”. 


You went on to study how functions can be defined by means of power 
series, and how the Ratio Test can be used to find their radius of 
convergence. You saw how to manipulate Taylor series to obtain new series 
from old by using the Combination Rules, the Product Rule and the 
Integration and Differentiation Rules. You also met the General Binomial 
Theorem, which gives the Taylor series at 0 for the function (1 + x)® for 

a € R. Finally, you looked at how the Taylor series at 0 for tan~! x can be 
used to obtain estimates for the value of r. 


Learning outcomes 


After working through this unit, you should be able to: 


e calculate the Taylor polynomial Ta(x) at a given point a of a given 
function f 


e appreciate that in many cases T(x) gives a good approximation to f(z) 
for x near the point a 


e state and use Taylor’s Theorem 


e appreciate that a sequence of Taylor polynomials may or may not 
converge at a given point to the value of the function at that point 


e state and use certain basic Taylor series 
e state the Radius of Convergence Theorem 


e determine the radius of convergence and the interval of convergence of 
certain power series 


e state and use the Combination Rules, Product Rule, Differentiation Rule 
and Integration Rule for Taylor series 


e state and use the General Binomial Theorem 


e understand and use the Uniqueness Theorem for Taylor series. 
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Table of standard Taylor series 


Function 


Domain 


Taylor series 


sin x 


COS £ 


(1+ 2) 


sinh x 


cosh x 


l+ata?+a3+--- 


2 43 4 
ear ae 
1 g” 
Pak at 
oe xð r’ 
"=a a T" 
r? xf gê 
ara at 
— 1 
ip ) 92 
2! 
ee xð r’ 
trate tat 
2 4 6 


oe) 


-Ya 


n=0 


=0 
7 asd (—1)"22"+1 
S (2n +1)! 


oO (=I 


2n+1 


je] <1 


=] <y 1 


xLxeER 


xLxeER 


xLxeER 


lz|<1,aeER 


xLeER 


xLeER 


Ie] <1 


Solutions to exercises 


Solution to Exercise F55 
The tangent approximation to f at a is 
f(x) ~ f(a) + f'(a)(@ — a). 
(a) We have 
f(x) =e", f(2)=e? 
f(a) =e", fi(2)=e?. 
Hence the tangent approximation to f at 2 is 


e we? +e (e-— 2) = e(z- 1). 


(b) We have 
f(z)=cosz, f(0)=1 
f(z) =—-sinz, f'(0)=0. 


Hence the tangent approximation to f at 0 is 


cosg ¥1+0(¢—0) = 1. 


Solution to Exercise F56 


(a) We have 
f(@z)=e", fQ)=e 
fa)=e, fiQ)=¢ 
f''(a) = er, f"(2) = e2 
Oa) =e, fO(2) = e. 
Hence 
Ti (x) = f(2) + f'(2)(æ — 2) = e? + e° (x — 2) 
To(x) = Tı (£) + PR, So)" 


T3(x) = To(x) + 


=e + e(z- 2) + e(z- 2)” 
+e (x — 2)°. 


(b) We have 
f(x) = cosx, (0) =1 
f'(z)=-sinz, f’(0)=0 
f"(&)=-— cosx, f”(0)=-—1 
fe) x)= sing, f® 0) = 0; 


Solutions to exercises 


Hence 
Ti(x) = f(0) + f'(0)z =1 


To(x) = Ti (x) + LO =1- 42? 


T3(a) = To(x) + —— 


Solution to Exercise F57 
The Taylor polynomial of degree 4 for f at a is 


ieee Peds! og 


21 
(3) 
pw a (z-a)? + — —(#— a)". 


(a) We have 
f(x) = log(1 + x), f(0) = 


F(=) =1/(1 +2), f'() = 
Pa@j=-ll+e", f'0)=-1 
fO) =2/(1+2)?,  fOO)=2 
fO@) =-6/(1+2)*, FOO) =-6 
Hence 
Ty(x) =a — $a? + zu" — fat. 
(b) We have 
f(x)=sinz, —f(m/4) =1/v2 


fi(n/4) =1/v2 


f'(z)=—sinz, f"(r/4) = —-1/V2 

fO (a) =—-cosx f(n/4) = -1/V2 

f(x) = sing, f (a/4) = 1/2. 
Hence 
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(c) Since f(z) =14 42-52? — ir’ + 424, we 
have f(0) = 1 and 


f(z) =5-ac-$a? +25, f'(0) =5 

Paz 1-ri3, PO- 

f(x) = -1 + 62, fO =- 

pa (a) =6, f(0) =6 
Hence 


(Note that T4(x) = f(x), as you might expect since 
f is a polynomial of degree 4.) 


Solution to Exercise F58 
From Worked Exercise F41(a), we have 


L@)=2— 40 
so 

T3(0.1) = 0.1 — 0.001/6 = 0.099 83. 
Since 

sin(0.1) = 0.099 833 416... , 
we have 


| sin(0.1) — T (0.1)| = 0.099 833 416... — 0.099 83 
< 0.099 833 417 — 0.099 833 333 
= 0.000 000 084 < 1 x 1077, 


as required. 


Solution to Exercise F59 


(a) We have 
fiz)= e, f(0)=1, 
and in general, for each positive integer k, 
Hence 
2 n 
Ta (2) sltet+ Ste to. 
(b) We have 
f(x) = log(1 + x), f(0) =0 
fi(z)=1/d4+se), f'(0)=1 
f") = -1/1 +z), f"(0)=-—1 
fO) =2/ +2), fF) =2 
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and in general, for each positive integer k, 


TETE (—1)¥ti(k — 1)! 
so that 


fOO = (- eV) 
Hence 
ee E E E E ol 
T(z) =£- 5% + 3 +(-1) = 
(c) We have 
F(z) = cos, 
fi(e)=—sinz, = f'(0) =0 
f"\(ax)=-cosz, f"(0)=—-1 
f(x) = sina, 
f(x) = cosa, 
and in general, for k = 0,1,2,..., 
FO) (0) = (-1)* and f+) (0) =0. 


Hence, for m = 0,1,2,..., 


m „ wk 
k=0 

1 pt am 
ae a ae a 


and Tom4+1 (x) = Tom (T): 
So if n takes either of the values 2m or 2m + 1, 
then we have 


Tr(z) =1-—+—-.---4 


Solution to Exercise F60 


From Exercise F59(c), the nth Taylor polynomial 
at 0 for f is 


a xt gem 


Tn(x) = 1 ! oe. Gm!’ 


where n is either 2m or 2m + 1. Hence, by Taylor’s 
Theorem with a = 0, f(x) = cosx and n = 3, 


cos xz = T(x) + Rg(x) 
=1- iz’ T R(x), 
where 
(4) 
SOO a e 4 
4! 4! 
for some c between 0 and zx, as required. 


R3(x) = 


Solution to Exercise F61 


(a) By the solution to Exercise F59(b), we have 


To(x) = x — a. 


(b) We use Strategy F11 with a = 0, x = 0.02 and 
n= 2; 


1. First, (OG) = aa see Exercise F'59(b). 
2. Thus 
|f(e)| = aso <2, for c€ (0,0.2], 
so we can take M = 2. 
3. Hence 


| log (1.02) — T,(0.02)| = |R2(0.02)| 


< oa |z — a|? tt 


2 3 
an" |0.02 — 0| 
= 0.000 0026 
< 3x107, 


as required. 

(c) By part (a), 

T>(0.02) = 0.02 — (0.02)? = 0.0198. 
By part (b), 

| log (1.02) — T}(0.02)| < 0.000 003. 
Hence 

0.019 797 < log(1.02) < 0.019 803, 
sO 


log(1.02) = 0.0198 (to 4 d.p.). 


Solution to Exercise F62 


(a) Using the derivatives of f found in the 
solution to Exercise F59(c), we obtain 


f(z) = =l; f'(x) = 0, f" (x) = I; 
Om) =0, fOM = -1. 
Hence 


Ty(x) = -1+ 4(£- r) — Ale- r)". 


Solutions to exercises 


(b) We use Strategy F12 with I = [37r /4, 57/4], 
a=7,r=7/4andn=4. 
1. First, f© (x) = — sin z. 
2. Thus 
[fO(c)| <1, force [3n/4, 5/4], 
so we can take M = 1. 
3. Hence 


<3x107%, for x € [37r/4, 5/4]. 
Thus T(x) approximates f(x) with an error less 
than 3 x 1073 on [37/4, 57/4]. 


Solution to Exercise F63 
(a) Here a, = 2" + 4”, for n =0,1,2,..., so 
gnt+1 antl 

an pa 4n 


Dividing the numerator and denominator by 4” 
gives 


An+1 
an 


9 n 
== 2(1/2)" +4 —> 4 as n —> CO. 
(1/2)" +1 

Hence, by the Ratio Test, the radius of 
convergence is R = i: 
(b) Here an = (n!)?/(2n)!, for n =1,2,..., so 
((n+1)!)? — (2n)! 
W—_— x 

(2n+2)! (n)? 

(n+ 1)(n +1) 
(2n + 2)(2n +1) 


Dividing the numerator and denominator by n? 


An+1 
an 


An+1 
an 


gives 

|= UEU OEL EL > s as n > œ. 
an (2+2/n)\(2+1/n) 4 

Hence, by the Ratio Test, the radius of 

convergence is R = 4. 


(c) Here an = n +27”, for n = 0,1,2,..., so 


An+1 —ntl4+2-1 
an | n +27” 
1+1 1/(n2"t1 
— Le lfn sl/r") ss asin 00. 
1 + 1/(n2”) 
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Hence, by the Ratio Test, the radius of which is again divergent, by the Non-null Test. 
convergence is = 1. Hence the interval of convergence is (—1, 1). 
= n = 
(dy, Here ay =n", for m= 1,2) 1405 80 (b) Here an = (—1)"/(n3"), for n =1,2,.... 
antij C ii 1. Since 
an n” An+1 1 n3” 
pay” a ~ (rt hart T 
=(n+1) — co as n —> oœ. g i i 
n 
= ———_ > -an> oo, 
Hence, by the Ratio Test, the radius of (1+ 1/n)3 3 
convergence is R = 0; that is, the series converges we have R = 3, by the Ratio Test. Since a = 5, 
only for x = 0. this power series 


e converges for 2 < x < 8, 


Solution to Exercise F64 é-diverves fot 2 Send o 22. 


Applying the Ratio Test with 2. If x = 8, then the power series is 
_ le 1a ne) œ (—1)” = (1) 
m = ni 5 for n EN, ip a G- =3 1 
n=1 n=l 


we obram Tore 0,1 Auei which is convergent, by the Alternating Test. 


An+1} _ ao — 1): (a-n) 7 n! If x = 2, then the power series is 
an a(a—1)---(a-n+1) (n+1)! 00 ar 
n+1 n=1 
(a/n) —1 which is a basic Poesia series. 
= [|-> | > lasn- oo. . . 
14+1/n Hence the interval of convergence is (2, 8]. 


Hence the radius of convergence is 1. p . 
Solution to Exercise F66 


Solution to Exercise F65 (a) We know that, for x € R, 
In each case, we apply Strategy F13. p. 98 an 
(a) Here a, = n, for n = 1,2,.... Pa=ltets pe ae Ea 
1. Since and 
m i eta 24% Fy ean 
=1+1/n > 1 asn > oo, Hence, by the Sum and Multiple Rules, 


we have R = 1, by the Ratio Test. Thus this 


: E pe =y 
; sinh x = 5(e” — e`”) 
power series 


F g2n+1 
e converges for —1 < x < 1, spri pheta ae 
e diverges for x > 1 and x < —1. 3! (2n +1)! 
2. If x = 1, then the power series is for x € R. 
5 n(1)" — er (b) We know that, for |x| < 1, 
n=0 n=0 path y= r r’ r” 
which is divergent, by the Non-null Test. “= ee 3 n 
If x = —1, then the power series is and 
OO co 1 
n n = 2 O Mosan 
2 = CD n, l-g Tatr Sie eee a a 
n= n 
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Solutions to exercises 


Hence, by the Sum and Multiple Rules, Hence, by the Product Rule, 
1 
log(1— 2) + 5 E T op eA 
= -gT 
7 ro 78 g” x (1 +3r +5r? +--+ (2n +1)” +) 
“Ae a =14+ (3+1) +(5+3+1)z? +- 
+ (2 +22 +22? +22 +- H2) (n +1) + (n= 1) eee je eee 
=144¢49074+---4+(n+1)*%ao™ +--., 


Ja Da 1 
—2 3 = m S E E 
en a +( =) ot for |z| < 1, since 1+3+4---+(2n+1) is an 
arithmetic series with sum (n + 1)?. 


for |z| < 1. 
Solution to Exercise F67 Solution to Exercise F68 
(a) We know that, for |æ| < 1, (a) We know from Exercise F66(a) and 
3 Theorem F65 that, for x € R, 
log(1+2) =2—- + —- i r 2? xenti 
A a a ry te aa a 
rE oe 3!" 5 (n+1)! 
n and 
Hence, by the Product Rule, m: gant 
] = — _ —— —— — sso — No 
(1 +2) log(1 +2) SO pig Gat 
_ (5p Hence, by the Sum Rule, for x € R, 
2 3 
š n j sinh x + sin x 
£ 
+ («?- —+---+4+(-1)” Z +) 2x5 2x9 Q7Antl 
2 n— 1 S26 + + a 
a es on 5! 9! (4n + 1)! 
= n 
=7%+ > 6 Feret (1) n(n—1) cane This Taylor series is valid for all real values of x. 
for |x| < 1. (b) We know that 
(b) We know that, for |x| < 1, _ = fgg a1) es, 
1 = 2 n : 
=" Se ea ee for |x| < 1. Replacing x by 2x7, we obtain 
H , by the Product Rule, 1 
aa y the Froduc ule ques ee ee ee ees 
—— = Cee tebe) 
(1-2) This last series converges for 2x? < 1; that is, for 
=1+(1+1)2+(1+1+4+1)2°+--- |z| < 1/v2. 
+14 l)e” Hence this Taylor series is valid in the interval 
=1+2r +32? +--+ (n+1)z”+---,  (—1/V2, 1/V2). 
Deak Solution to Exercise F69 
k hat, f 1 
(c) i now that, for |z| < 1, We know that 
a i, 2 sss ee de a 2 n 
roo pate terto e E for z €R. 
! n! 
d, for İf Worked Exercise F48, that 
i i d team) Worked Exerc: ak Also, for |x| < 1, from Exercise F67(b), 
+2 2 
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Unit F4 Power series 


Hence, by the Product Rule, 
eda) 
=(l+a2t+ga7+---)(14+2x+327+---) 
=1+(2+1)r+(3+24+5)a2°+--- 


=1+3r+ 4z? +., for |2| <1. 


Solution to Exercise F70 
(a) We know from the solution to Exercise F67(b) 
that, for || < 1, 

(1—2)? =14 294 3n? +---4+ (n+ le" +- 
Hence, by the Differentiation Rule, 

(1 — £)? =2462+---+(n+1)na™14+..., 
for |z| < 1. 
Applying the Multiple Rule, we obtain 


1 
e a a a re 
for |z| < 1. 


(b) From the table of standard derivatives in the 
Quick reference section of the module Handbook, 


l 
we have f'(x) = E for |z| < 1. Also, we know 
-gr 
that 
1 
To e e for |z| < 1. 
—2 
Replacing x by x”, we obtain 
f'(z) =14ta%tatt-.-+a™4+.--, for |z| <1. 


Thus, by the Integration Rule, the Taylor series 
at 0 for f is 


3 5 


oo 
Tetak t E 


for |x| < 1. Since f(0) = 0, it follows that c = 0. 


g2ntl 


fees, 


Hence 
arest i be piu 
7 3 5 2n +1 i 
for |z| < 1. 
Solution to Exercise F71 
Since 
r? r” 
e =1+r+> + + H+, forrER, 
2! n! 
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we have 
4 2n 
a E a ee E y ee 
e = 1 m+ 5 ga) err aaa 


for x € R. It follows from the Integration Rule that 


1 2 
1 e ®© dz 
0 


r3 x5 y2n+1 1 
= S22 eny D p ees 
E 3 75x i oa” i 
E dee a 
© 3 10 (2n + 1)n! 


Solution to Exercise F72 
We know that 


d 
= log(1+ x) = 


, 


lg 
and that, for |z| < 1, 


1 
Slog pg? are (1 r4 
Lg ae 
Hence, by the Integration Rule, 

2 n+l 

z x 

log(1 = Z jera 1)” l 

og(l +z)=c+rzr 5 + ( ema 


for |x| < 1, where c is a constant. 


On substituting x = 0, we find that c = 0. Hence 


2 


n 
log(1 + £) = £ — Dte +HZ 


+ ui 
for |z| < 1. (Note that (—1)"*! = (—1)"-1) 


Solution to Exercise F73 


By the General Binomial Theorem, 


oe) 


i 
= > ( 5) (4r)”, for |4z| < 1, 


n=0 


T E b= 
( 3 nra for n EN. 


— 44 3272 _.., 
=l- 9r+ 52 5 
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